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■p. .r^rr. wnMnT-oronS cv ^TTPWrF. TARRETTNG TW ^ftPYOTTC CELLS 
YJYJT, nF THE JWVENTION 

The invention relates to methods for targeting an 
exogenous polynucleotide or exogenous, complementary 
polynucleotide pair to a predetermined endogenous DNA target 

0 sequence in a eukaryotic cell by homologous pairing, 

particularly for altering an endogenous DNA sequence, such as a 
chromosomal DNA sequence, typically by targeted homologous 
recombination. In certain embodiments, the invention relates 
to methods for targeting an exogenous polynucleotide having a 

L5 linked chemical substituent to a predetermined endogenous DNA 
sequence in a metabolically active eukaryotic cell, generating 
a DNA sequence-specific targeting of one or more chemical 
substituents in an intact nucleus of a metabolically active 
eukaryotic cell, generally for purposes of altering a 

20 predetermined endogenous DNA sequence in the cell. The 

invention also relates to compositions that contain exogenous 
targeting polynucleotides, complementary pairs of exogenous 
targeting polynucleotides, chemical substituents of such 
polynucleotides, and recombinase proteins used in the methods 

25 of the invention. 

BACKGROUND 

Homologous recombination (or general recombination) 
is defined as the exchange of homologous segments anywhere 

30 along a length of two DNA molecules. An essential feature of 
general recombination is that the enzymes responsible for the 
recombination event can presumably use any pair of homologous 
sequences as substrates, although some types of sequence may be 
favored over others. Both genetic and cytological studies have 

35 indicated that such a crossing-over process occurs between 
pairs of homologous chromosomes during meiosis in higher 
organisms . 

Alternatively, in site-specific recombination, 
exchange occurs at a specific site, as in the integration of ■ 
40 phage X int the E. coli chromosome and the excision of X DNA 
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from it. site-specific recombination involves specific 
sequences of the phage DNA and bacterial DNA. Within these 
sequences there is only a sh rt stretch f homology necessary 
for the recombination event, but not sufficient for it. The 
5 enzymes involved in this event generally cannot recombine other 
pairs of homologous (or nonhomologous) sequences, but act 
specifically on the particular phage and bacterial sequences. 

Although both site-specific recombination and 
homologous recombination are useful mechanisms for genetic 
10 engineering of DNA sequences, targeted homologous recombination 
provides a basis for targeting and altering essentially any 
desired sequence in a duplex DNA molecule, such as targeting a 
DNA sequence in a chromosome for replacement by another 
sequence. Site-specific recombination has been proposed as one 
15 method to integrate transf ected DNA at chromosomal locations 
having specific recognition sites (O' Gorman at al. (1991) 
science 251 ; 1351; Onouchi et al. (1991) yiucleic Acids Res. 19: 
6373). Unfortunately, since this approach requires the 
presence of specific target sequences and recombinases, its 
20 utility for targeting recombination events at any particular 
chromosomal location is severely limited in comparison to 
targeted general recombination. 

For these reasons and others, targeted homologous 
recombination has been proposed for treating human genetic 
25 diseases. Human genetic diseases include: (1) classical human 
genetic diseases wherein a disease allele having a mutant 
genetic lesion is inherited from a parent (e.g., adenosine 
deaminase deficiency, sickle cell anemia, thalassemias), (2) 
complex genetic diseases like cancer, where the pathological 
30 state generally results from one or more specific inherited or 
acquired mutations, and (3) acquired genetic disease, such as 
an integrated provirus (e.g., hepatitis B virus). However, 
current methods of targeted homologous recombination are 
inefficient and produce desired homologous recombinants only 
35 rarely, necessitating complex cell selection schemes to 
identify and isolate correctly targeted recombinants. 

A primary step in homologous recombination is DNA 
strand exchange, which involves a pairing of a DNA duplex with 
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at 1 ast one DNA strand containing a complementary sequence to 
f rm an intermediate recombination structure containing 
heteroduplex DNA (SSS, Radding, CM. (1982) f i nn, "P'^v, Gej\^%, . 
li: 405;' U.S. . Patent 4,888,274). The heteroduplex DNA may tak 
5 several forms, including a triplex form wherein a single 
complementary strand invades the DNA duplex (Hsieh et al. 
(1990) r.pnes and nAvelonment 4.: 1951) and, when two 
complementary DNA strands pair with a DNA duplex, a classical 
Holliday recombination joint or chi structure (Holliday, R. 
10 (1964) Renet. Res. 5: 282) may form, or a double-D loop 

("Diagnostic Applications of Double-D Loop Formation" U.S. S.N 
07/755,462, filed 4 September 1991, which is incorporated 
herein by reference) . Once formed, a heteroduplex structure 
may be resolved by strand breakage and exchange, so that all or 
15 ' a portion of an invading DNA strand is spliced into a recipient 
DNA duplex, adding or replacing a segment of the recipient DNA 
duplex. Alternatively, a heteroduplex structure may result in 
gene conversion, wherein a sequence of an invading strand is 
transferred to a recipient DNA duplex by repair of mismatched 
20 bases using the invading strand as a template (figngs, 3rd Ed. 
(1987) Lewin, B., John Wiley, New York, NY; Lopez et al. (1987) 
MnniA.r. Aeids Res. 15: 5643) . Whether by the mechanism of 
breakage and rejoining or by the mechanism (s) of gene 
conversion, formation of heteroduplex DNA at homologously 
25 paired joints can serve to transfer genetic sequence 
information from one DNA molecule to another. 

The ability of homologous recombination (gene 
conversion and classical strand breakage/rejoining) to transfer 
genetic sequence information between DNA molecules makes 
30 targeted homologous recombination a powerful method in genetic 
engineering and gene manipulation. 

The ability of mammalian and human cells to 
incorporate exogenous genetic material into genes residing on 
chromosomes has demonstrated that these cells have the general 
35 enzymatic machinery for carrying out homologous recombination 
required between resident and introduced sequences. These 
targeted recombination vents can be us d to correct mutations 
at known sites, replace genes or gen segments with defective 
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ones, or introduce foreign genes into cells. The efficiency of 
such gene targeting techniques is related to several 
parameters: the efficiency of DNA delivery into cells, the type 
of DNA packaging (if any) and the size and conformation of the 
5 incoming DNA, the length and position of regions homologous to 
the target site (all these parameters also likely affect the 
ability of the incoming homologous DNA sequences to survive 
intracellular nuclease attack) , the efficiency of recombination 
at particular chromosomal sites and whether recombinant events 

10 are homologous or nonhomologous. Over the past 10 years or so, 
several methods have been developed to introduce DNA into 
mammalian cells: direct needle microinjection, transf ection, 
electroporation, retroviruses, adenoviruses, and other viral 
packaging and delivery systems, liposomes, and most recently 

15 techniques using DNA-coated microprojectiles delivered with a 
gene gun (called a biolistics device) , or narrow-becim lasers 
(laser-poration) . The processes associated with some types of 
gene transfer have been shown to be both mutagenic and 
carcinogenic (Beordwell, (1989) Mutagenesis ^:245) , and these 

20 possibilities must be considered in choosing a transfection 
approach. 

The choice of a particular DNA transfection procedure 
depends upon its availability to the researcher, the 
technic[ue*s efficiency with the particular chosen target cell 

25 type, and the researchers concerns about the potential for 

generating unwanted genome mutations. For example, retroviral 
integration requires dividing cells, most often results in 
nonhomologous recombination events, and retroviral insertion 
within a coding sequence of nonhomologous (i.e., non-targeted) 

30 gene could cause cell mutation by inactivating the gene's 

coding sequence (Friedmann, (1989) Science 244 ;1275K Newer 
retroviral-based DNA delivery systems are being developed using 
defective retroviruses. However, these disabled viruses must 
be packaged using helper systems, are often obtained at low 

35 titer, and recombination is still not site-specific, thus 

recombination between endogenous cellular retrovirus sequences 
and disabled virus sequences could still produce wild-type 
retrovirus capable of causing gene mutation. Adeno- or p lyoma 
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Virus based delivery systems appear very promising (Samulski et 
al., (1991) EMBO J. IQ; 3941; Gareis et al., (1991) fieU^. 
M»ile. Biol . 22: 191; Rosenfeld et al. (1992) Qsll &S.i 143) 
although they still require specific cell membrane recognition 
5 and binding characteristics for target cell entry. Liposomes 
often show a narrow spectrum of cell specificities, and when 
• DNA is coated externally on to them, the DNA is often sensitive 
to cellular nucleases. Newer polycationic lipospermines 
compounds exhibit broad cell ranges (Behr et al., (1989) EEfZS^ 
10 r ^^f h"-'^- S"^- tiSA M:6982) and DNA is coated by these 

compounds. In addition, a combination of neutral and cationic 
lipid has been shown to be highly efficient at transfection of 
animal cells and showed a broad spectrum of effectiveness in a 
variety of cell lines (Rose et al., (1991) p1 oTe<?t>n j.qu?s 
IS 'lfi:520). Galactosylated bis-acridine has also been described 
as a carrier for delivery of polynucleotides to liver cells 
(Haensler JL and Szoka FC (1992) , Abstract V211 in iT, Cellt 
Bioehem. Supplement 16F, April 3-16, 1992, incorporated herein 
by reference) . Electroporation also appears to be applicable 
20 to most cell types. The efficiency of this procedure for a 
specific gene is variable and can range from about one event 
per 3 X 10* transfected cells (Thomas and Capecchi, (1987) SsH 
51:503) to between one in lo'' and 10^ cells receiving the 
exogenous DNA (Koller and Smithies, (1989) ?rnr, T^^^t-I t Aeadt 
25 sej. m.s.A.) M: 8932) . Microinjection of exogenous DNA into 
the nucleus has been reported to result in a high frequency of 
stable transfected cells. Zimmer and Gruss (Zimmer and Gruss 
(1989) Nature 338 ; 150) have reported that for the mouse ho?^J-f t 
gene, 1 per 150 microinjected cells showed a stable homologous 
30 site specific alteration. 

Several methods have been developed to detect and/or 
select for targeted specific recombinants between vector DNA 
and the target homologous chromosomal sequence (gge, Capecchi, 
(1989) science 244:1288 for review). Cells which exhibit a 
35 specific phenotype after specific homologous recombination, 
such as occurs with alteration of the liEEi gene, can be 
obtained by direct selection on the appropriate growth medium. 
Alternatively, a selective marker sequ nee such as ngo can be 
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incorporated into a vector under promoter control, and 
successful transfection can be scored by selecting G418^ cells 
followed by PGR to determine whether neo is at the targeted 
site (Joyner et al., (1989) Nature 218:153). A positive- 
5 negative selection (PNS) procedxire using both neo and HSV-tJi 
genes iallows selection for transfectants and against non- 
homologous recombination events, and significantly enriched for 
desired disruption events at several different mouse genes 
(Mansotir et al., (1988) Nature 326:348). This procedure has 

10 the advantage that the method does not require that the 
targeted gene be transcribed. If the targeted gene is 
transcribed, a promoter-less marker gene can be incorporated 
into the targeting construct so that the gene becomes activated 
after homologous recombination with the target site (Jasin and 

15 Berg, (1988) Genes and Development 2.: 1353; Doetschman et al. 

(1988) Proc. Natl. Acad, Sci. fU.S.A,> 85 ; 8583; Dorini et al., 

(1989) Science 243 :1357; Itzhaki and Porter, (1991) Nucl, Acids 
Res. i9:3835). Recombinant products produced using vectors 
with selectable markers often continue to retain these meorkers 

20 as foreign genetic material at the site of transfection, 

although loss does occur. Valancius and Smithies (Valancius 
and Smithies, (1991) Molec. Cellular Biol. 11:1402) have 
recently described an "in-out" targeting procedure that allowed 
a subtle 4-bp insertion modification of a mouse hprt target 

25 gene. The resulting transfectant contained only the desired 
modified gene sequence and no selectable marker remained after 
the "out" recombination step. Cotransf ormation of cells with 
two different vectors, one vector containing a selectable gene 
and the other used for gene disruption, increases the 

30 efficiency of isolating a specific targeting reaction (Reid et 
al., (1991) Molec, Cellular Biol. 11:2769) among selected cells 
that are subsequently scored for stable recombinants. 

Unfortunately, exogenous sequences transferred into 
eukaryotic cells undergo homologous recombination with 

35 homologous endogenous sequences only at very low frequencies, 
and are so inefficiently recombined that large numbers of cells 
must be transfected, selected, and screened in order to 
generate a desired correctly targeted homologous recombinant 
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(Kucherlapati et al. (1984) Proc. Nat;, ftrf^rl Sri. (U.S.A. ) . 81: 
3153; smithies, 0.(1985) Nature 317; 230; Song et al. (1987) 
l>r•,^r■■ Watl . ^^^i - m.s.A.) 84: 6820; Doetschman et al. 

(1987) Nature 330; 576; Kim and Smithies (1988) W^rAe.%c ftci^s 
Res, li: 8887; Doetschman et al. (1988) pp,«?j.tt ; Koller and 
smithies (1989) on^situ; Shesely et al. (1991) ?roc, mit 
fi.,^ m.s.A.^ 88; 4294; Kim et al. (1991) ggna Ifll: 227, 

which are incorporated herein by reference) . 

Recently, Koller et al. (1991) ?ror.. NaUt ftc^d, ggj-t 
m.s.A.-> M: 10730 and Snouwaert et al. (1992) Ssisnss 252: 
1083, have described targeting of the mouse cystic fibrosis 
transmembrane regulator (CFTR) gene for the purpose of 
inactivating, rather than correcting, a murine CFTR allele. 
15 'Koller et al. employed a large (7.8kb) homology region in the 
targeting construct, but nonetheless reported a low frequency 
for correct targeting (only 1 of 2500 G4 18 -resistant cells were 
correctly targeted). Thus, even targeting constructs having 
long homology regions are inefficiently targeted. 
20 Several proteins or purified extracts having the 

property of promoting homologous recombination (i.e., 
recombinase activity) have been identified in prokaryotes and 
euJcaryotes (Cox and Lehman (1987) hT)n- ^^ev, ^^ocheTn. f6:229; 
Radding, CM. (1982) on-cit. ; Madiraju et al. (1988) £eoc^ 
25 Acad. ficA . ru.s.A.^ £S: 6592; ; McCarthy et al. (1988) 

Pr-ft». Watl. ar'-ad. Sei. m.s.A.^ £5: 5854; Lopez et al. (1987) 
op.cit. . which are incorporated herein by reference) . These 
general recombinases presumably promote one or more steps in 
the formation of homologous ly-paired intermediates, strand- 
30 exchange, gene conversion, and/or other steps in the process of 
homologous recombination. 

The frequency of homologous recombination in 
prokaryotes is significantly enhanced by the presence of 
recombinase activities. Several purified proteins catalzye 
35 homologous pairing and/or strand exchange in ^itro , including: 
E. coli recA protein, the T4 uvsX protein, and the reel protein 
from Ustilago- maydis. Recombinases, lik the recA protein of 
E. coli are proteins which promote strand pairing and exchange. 
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The most studied recombinase to date has been the recA 
recombinase of E. coli, which is involved in h mology search 
and strand exchange react i ns (gee, Cox and Lehman (1987) 
op.eit. ^ . RecA is required for induction of the SOS repair 
5 response, DNA repair, and efficient genetic recombination in E. 
coli. RecA can catalyze homologous pairing of a linear duplex 
DNA and a homologous single strand DNA in vitss. In contrast 
to site-specific recombinases , proteins like recA which are 
involved in general recombination recognize and promote pairing 
10 of DNA structures on the basis of shared homology, as has been 
shown by several in vitro experiments (Hsieh and Camerini-Otero 
(1989) .T. Biol. Chem. 264: 5089; Howard-Flanders et al. (1984) 
Nature 309 ; 215; Stasiak et al. (1984) Cold Sprjng Harbor Symp.- 
ouant. Biol. 42: 561; Register et al. (1987) J. B^ol. Chem, 
15 262 : 12812) . Several investigators have used recA protein in 
vitro to promote homologously paired triplex or other three- 
stranded joint DNA molecules (Cheng et al. (1988) J- 
chem. 263 : 15110; Ferrin and camerini-Otero (1991) gc^ei^pe 2M: 
1494; Ramdas et al. (1989) .t. Biol, chem. 2M: 17395; Strobel 
et al. (1991) Science 254 ; 1639: Hsieh et al. (1990) op.c?,^, ; 
Rigas et al. (1986) Pt-oc. Natl. Aead. Sei.. fU.S.A.) fil: 9591; 
and Camerini-Otero et al. U.S. 7,611,268 (available from 
Derwent) , which are incorporated herein by reference) . 
Unfortunately, many important genetic engineering manipulations 
25 involving homologous recombination, such as using homologous 
recombination to alter endogenous DNA sequences in a living 
cell, cannot be done in vitro . Further, gene therapy requires 
highly efficient homologous recombination of targeting vectors 
with predetermined endogenous target sequences, since 
30 selectable marker selection schemes such as those currently 
available in the art are not usually practicable in human 
beings. 

Thus, there exists a need in the art for methods of 
efficiently altering predetermined endogenous genetic sequences 
35 by homologous pairing and homologous recombination in x4vo by 
introducing one or more exogenous targeting polynucleotide (s) 
that efficiently and specifically homologously pair with a 
predetermined endogenous DNA sequence. There exists a need in 



20 
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the art for high-efficiency gene targeting, so that complex in 
vitro selection prot cols (e.g., neo gene selection with G418) 
which are of limited utility for in Vivp gene therapy on 
affected individuals, are avoided. 

SUMMARY Q V THB TMVEWTION 

It is an object of the present invention to provide 
methods for targeting an exogenous polynucleotide to a 
predetermined endogenous DNA target sequence in a eukaryotic 
cell with high efficiency and with sequence specificity. 
Exogenous polynucleotides, are localized (or targeted) to one 
or more predetermined DNA target sequence (s) by homologous 
pairing in vivo . Such targeted homologous pairing of exogenous 
polynucleotides to endogenous DNA sequences in Zivo nay be 
"used: (1) to target chemical substituents in a sequence- 
specific manner in vivo . (2) to correct or to generate genetic 
mutations in endogenous DNA sequences by homologous 
recombination and/ or gene conversion, (3) to produce 
homologously targeted transgenic animals at high efficiency, 
and (4) in other applications (e.g., targeted drug delivery) 
based on in vivo homologous pairing. Some embodiments of the 
invention employ targeted exogenous polynucleotides to correct 
endogenous mutant gene alleles in human cells; the invention 
provides methods and compositions for correcting disease 
alleles involved in producing human genetic diseases, such as 
inherited genetic diseases (e.g., cystic fibrosis) and 
neoplasia (e.g., neoplasms induced by somatic mutation of an 
oncogene or tumor suppressor gene, such as p53, or viral genes 
associated with diease or neoplasia, such as HBV genes) . 

In one embodiment, at least one exogenous 
polynucleotide is targeted to a predetermined endogenous DNA 
sequence and alters the endogenous DNA sequence, such as a 
chromosomal DNA sequence, typically by targeted homologous 
recombination within and/ or flanking the predetermined 
endogenous DNA sequence. Generally, two complementary 
exogenous polynucleotides are used for targeting an endogenous 
DNA sequence.' Typically, the targ ting polynucl otid (s) are 
introduced simultaneously or contemporaneously with one or more 
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recombinase species. Alternatively, one or more recombinase 
species may be produced in vivo by expression of a heterologous 
expression cassette in a cell containing the preselected target 
DNA sequence. 

It is another object of the invention to provide 
methods whereby at least one exogenous polynucleotide 
containing a chemical substituent can be targeted to a 
predetermined endogenous DNA sequence in a met'abolically-active 
eukaryotic cell, permitting sequence-specific targeting of 
chemical substituents such as, for example: cross-linking 
agents, metal chelates (e.g., iron/EDTA chelate for iron 
catalyzed cleavage), topoisomerases , endonucleases , 
exonucleases, ligases, phosphodiesterases, photodynamic 
porphyrins, free-radical generating drugs, quinones, 
chemotherapeutic drugs (e.g., adriamycin, doxirubicin) , 
intercalating agents, base-modification agents, immunoglobulin 
chains, oligonucleotides, and other substituents. The methods 
of the invention can be used to target such a chemical 
substituent to a predetermined DNA sequence by homologous 
pairing for various applications, for example: producing 
sequence-specific strand scission (s) , producing sequence- 
specific chemical modifications (e.g., base methylation, strand 
cross-linking) , producing sequence-specific localization of 
polypeptides (e.g. , topoisomerases, helicases, proteases), 
producing sequence-specific localization of polynucleotides 
(e.g., loading sites for replication factors, transcription 
factors and/or RNA or DNA polymerases), and other applications. 

It is another object of the present invention to 
provide methods for correcting a genetic mutation in an 
endogenous DNA target sequence, such as a sequence encoding a 
protein. For example, the invention can be used to correct 
genetic mutations, such as base substitutions, additions, 
and/ or deletions, by converting a mutant DNA sequence that 
encodes a non-functional, dysfunctional , and/ or truncated 
polypeptide into a corrected DNA sequence that encodes a 
functional polypeptide (e.g., has a biological activity such as 
an enzymatic activity, hormone function, or other biological 
property). The methods and compositions of the invention may 
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also be used to correct genetic mutati ns or dysfunctional 
alleles with genetic lesi ns in non-coding sequences (e.g. , 
promoters, enhancers, silencers, origins of replication, 
splicing signals) . In contradistinction, the invention also 
can be used to target DNA sequences for inactivating gene ^ 
expression; a targeting polynucleotide can be employed to make 
a targeted base substitution, addition, and/or deletion in a 
structural or regulatory endogenous DNA sequence to alter 
expression of one or more genes, typically by knocking out at 
, least one allele of a gene (i.e., making a mutant, 

nonfunctional allele) . The invention can also be used to 
correct disease alleles, such as a human or non-human animal 
CFTR gene allele associated with cystic fibrosis, by producing- 
a targeted alteration in the disease allele to correct a 
5 "disease-causing lesion (e.g., a deletion). 

It is a further object of the invention to provide 
methods and compositions for high-efficiency gene targeting of 
human genetic disease alleles, such as a CTTR allele associate! 
with cystic fibrosis or an LDL receptor allele assocaited with 
0 familial hypercholesterolemia. In one aspect of the invention, 
targeting polynucleotides having at least, one associated 
recombinase are targeted to cells An siSS (i-e-. in an intact 
animal) by exploiting the advantages of a receptor-mediated 
uptake mechanism, such as an asialoglycoprotein receptor- 
25 mediated uptake process. In this variation, a targeting 

polynucleotide is associated with a recombinase and a cell- 
uptake component which enhances the uptake of the targeting 
polynucleotide-recombinase into cells of at least one cell type 
in an intact individual. For example, but not limitation, a 
30 cell-uptake component typically consists essentially of: (1) a 
galactose-terminal (asialo-) glycoprotein (e.g., 
asialoorosomucoid) capable of being recognized and internalized 
by specialized receptors (asialoglycoprotein receptors) on 
hepatocytes in vivo, and (2) a polycation, such as poly-L- 
35 lysine, which binds to the targeting polynucleotide, usually by 
electrostatic interaction. Typically, the targeting 
polynucleotide is coated with recombinase and cell-uptake 
component simultaneously so that both recombinase and cell- 
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uptake component bind t the targeting polynucleotide; 
alternatively, a targeting p lynucleotide can be coated with 
recombinase prior t incubation with a cell-uptake component; 
alternatively the targeting polynucleotide can be coated with 
5 the cell-uptake component and introduced into cells 

comtemporaneously with a separately delivered recombinase 
(e.g., by targeted liposomes containing one or more 
recombinase) . 

The invention also provides methods and compositions 

10 for treatment and prophylaxis of genetic diseases of animals, 
particularly mammals, wherein a recombinase and a targeting 
polynucleotide are used to produce a targeted sequence 
modification in a disease allele of an endogenous gene. The 
invention may also be used to produce targeted sequence 

15 modification (s) in a non-human animal, particularly a non-humem 
maimmal such as a mouse or rabbit, which create (s) a disease 
allele in a non-human animal. Sequence-modified non-h\man 
animals harboring such a disease allele may provide useful 
models of human and veterinary disease (s). Alternatively, the 

20 methods and compositions of the invention can be used to 

provide non-human animals having homologously-targeted hviman 
disease alleles integrated into a non-human genome; such non- 
human animals may provide useful experimental models of humem 
genetic disease, including neoplastic diseases and 

25 atherosclerotic disease. 

It is also an object of the invention to provide 
methods and compositions for recombinase-enhanced positioning 
of a targeting polynucleotide to a homologous sequence in an 
endogenous chromosome to form a stable multistrand complex, and 

30 thereby alter expression of a predetermined gene secjuence by 
interfering with transcription of sequence (s) adjacent to th 
multistrand complex. Recombinase (s) are used to ensure correct 
homologous pairing and formation of a stable multistrand 
complex, which may include a double-D loop structure. For 

35 example, a targeting polynucleotide coated with a recombinase 
may homol gously pair with an endogenous chromosomal sequence 
in a structural or regulatory sequence of a gene and form a 
stable multistrand complex which may: (1) constitute a 
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significant physical or chemical bstacle to formation of or 
pr cession of an active transcriptional complex comprising at 
least an RNA polymerase, or (2) alter the local chromatin 
structiire so as to alter the transcription rate of gene 
5 sequences within about 1 to 500 kilobases of the multistrand 
complex. 

It is another object of the invention to provide 
methods and compositions for treating or preventing human and 
animal diseases, particularly viral diseases, such as hxman 

10 hepatitis B virus (HBV) hepatitis, by targeting viral gene 
sequences with a recombinase-associated targeting 
polynucleotide and thereby inactivating said viral gene 
sequences and inhibiting viral-induced pathology. 

It is a further object of the invention to provide 

15 compositions that contain exogenous targeting polynucleotides, 
complementary pairs of targeting polynucleotides, chemical 
substituents of such polynucleotides, and recombinase proteins 
used in the methods of the invention. Such compositions may 
include cell-uptzJce components to facilitate intracellulsir 

20 uptake of a targeting polynucleotide, especially for e^ vivo 
and in vivo gene therapy, transgenesis, and gene modification. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figs. lA-lD. Homologous targeting of EgcA-coated 

25 chromosome 1 alpha-satellite polynucleotides in living cell 

nuclei. The homologously targeted biotinylated polynucleotides 
were visualized by addition of FITC-avidin followed by washing 
to remove unbound FITC. Signals were visualized using a Zeiss 
Confocal Laser Scanning Microscope (CLSM-10) with 488 nm argon 

30 laser beam illxamination for FITC-DNA detection. Fig. lA - 

localized FITC-DNA signals in cell nucleus. Fig IB - enhanced 
image of FITC-DNA signals in cell nucleus. Fig. IC - image of 
FITC-DNA signals overlaid on phase image of nucleus. Fig. ID - 
phase image of center of cell nucleus showing nucleoli. Note: 

35 all images except lower right were photographed at same focus 
level (focus unchanged between these photos) • 

Figs. 2A-2C. Homologous targeting of Eec^-coat d 
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chromosome 1 alpha-satellite polynucleotides in living cell 
nuclei. Fig. 2A - flu rescent image of FITC-DNA signals in 
cell nucleus. Fig. 2B - enhanced image of FITC-DNA signal in 
cell nucleus. Fig. 2C - overlay of FITC-DNA signals on phase 
image of nucleus. 

Figs. 3A-3B. Decondensed DNA from a targeted human 
chromosome 1 in a living cell nucleus displaying repeated 
alpha-satellite DNA sequences as visualized by FITC-DNA 
labeling; Fig. 3 A - short exposure; Fig. 3B- long exposure. 

Fig. 4. Human p53 tumor suppressor gene targeting in 
living HEp-2 cell nuclei. 

Figs. 5A and 5B. FITC-localization of EgSA-coated 
polynucleotides targeted to human chromosome 1 alpha-satellite 
sequences in a living cell nucleus. Fig. 5A - image of 
enhanced FITC-signals. Fig. 5B - overlay of FITC-signals on 
phase contrast image of cell nucleus. 

Fig. 6. Map of mammalian expression lacZ mutant 
plasmid pMCllacXpa. 

Fig. 7. Kap of mammalian expression lacZ wild-type 
plasmid pMCllacpa. 

Fig. 8. Multiple cloning site of plasmid pIBI30. 

Fig. 9. PGR products and primers from lacZ gene 

sequence. . 

Fig. lOA. Southern hybridization analysis of the 
687-bp fragment amplified from genomic DNA. Electrophoretic 
migration of a 687-bp DNA fragment generated with primers CFl 
and CF6 from genomic DNA of ZCFTE29o-cells which were capillary 
needle-microinjected with the 491-nucleotide fragment in the 
presence of recA (lane 2) or transfected as a protein-DNA-lipid 
complex where the 491-nucleotide fragments were coated with 
recA (+; lane 3). The control DNA was amplified from 
nontransfected i:CFTE29o-cult\ires (lane 1) . 

Fig. lOB Autoradiographic analysis of DNA transferred 
to Gene Screen Plus filters and hybridized with a ^^p-iabeled 
oligonucleotide specific for normal exon 10 sequences in the 
region of the aF508 mutation. Cells transfected by 
microinjection or protein-lipid-DNA complexes both were 
positive for homologous targeting, whereas control cells were 
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not* 

Fig. IIA. Analysis of DNA from cells electrop rated 
or transfected with DNA encapsulated in a protein-lipid 
complex. Allele-specific PGR amplification of the 687/684 bp 
DNA fragment amplified in the first round with primers CFl and 
oligo N (N) or oligo aF (aF) . Ethidium bromide-stained 300 bp 
DNA fragment separated by electrophoresis in a 1% agarose gel. 
The DNA in each lane is as follows: lane 1, 100-bp marker DNA; 
lane 2, control 16HBE14o-cell DNA amplified with the CFl/N 
primer pair; lane 3, nontransf ected 2:CFTE29o-cell DNA amplified 
with CFl/N primers; lane 4, nontransf ected ZCFTE29o-cell DNA 
amplified with CFl /aF primers; lane 5, DNA from SCFTE29o-cells . 
electroporated with recA-coated 491-nucleotide fragments and 
amplified with CFl/N primers; lane 6, DNA from ECFTE29o-cells 
transfected with recA-coated 491-nucleotide fragment 
encapsulated in a protein-lipid complex and amplified with 
CFl/N. 

Fig. IIB Autoradiographic analysis of the DNA in Fig. 
IIA transferred to Gene Screen Plus filters and hybridized with 
^2p-labeled oligo N probe. Samples in lanes 1-5 for the 
autoradiographic analysis are equivalent to lanes 2-6 in Fig. 
IIA. 

Fig. 12. PCR analysis of 2CFTE29o-genomic DNA 
reconstructed with the addition of 2 x 10^ copies of recA- 
coated 491-nucleotide fragments per microgram of genomic DNA. 
This ntimber of DNA fragments represents the total niomber of DNA 
copies microinj ected into cells and tests whether the 491- 
nucleotide fragment can act as a primer for the 687/684-bp 
fragment amplification. DNA was amplified as described in Fig. 
lOA. When the second round of amplification was conducted with 
CFl and the oligo N primers (lane 2) , the 300-bp DNA band was 
not detected when aliquots of the amplification reaction were 
separated electrophoretically. Amplification of the 2CFTE290- 
/491 bp DNA fragment with the CFl/oligo aF primer pair produced 
a 299-bp DNA product (lane 1) • Marker DNA is in lane 3. 

Fig. 13 graphically portrays the relationship between 
the number of" targeting polynucleotides introduced into cells 
and the frequency of gene correction using targeting 
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polynucleotides coated with recA or without recA. The 
indicated plasmids were microinjected int living cells at 
differing concentrations. Th average ntmber of plasmid 
molecules injected per cell was calculated from the 
5 concentration of plasmid in the fixed volume (approximately 0.5 
picolitre) of DNA solution injected per cell. Symbology used 
is: X = wild-type plasmid; O = mutant plasmid pMCllacXpa + 
wild-type 276-mer DNA coated with RecA protein; □ = mutant 
plasmid pMCllacXpa + wild-type 276-mer DNA without RecA 
10 protein. 

DEFINITIONS 

Unless defined otherwise, all technical and 
scientific terms used herein have the same meaning as commonly 

15 understood by one of ordinary skill in the art to which this 
invention belongs. Although any methods and materials similar 
or equivalent to those described herein can be used in the 
practice or testing of the present invention, the preferred 
methods and materials are described. For pxirposes of the 

20 present invention, the following terms are defined below. 

As used herein, the twenty conventional amino acids 
and their abbreviations follow conventional usage f Immunolocrv - 
A Synthesis . 2nd Edition, E.S. Golub and D.R. Green, Eds., 
Sinauer Associates, Sunderland, Massachusetts (1991) , which is 

25 incorporated herein by reference) . 

As used herein, the terms "predetermined endogenous 
DNA sequence" and "predetermined target sequence" refer to 
polynucleotide sequences contained in a eukaryotic cell. Such 
sec[uences include, for example, chromosomal sequences (e.g., 

30 structural genes, promoters, enhancers, recombinatorial 

hot spots, repeat sequences, integrated proviral sequences), 
episomal sequences (e.g., replicable plasmids or viral 
replication intermediates) , chloroplast and mitochondrial DNA 
sequences. By "predetermined" it is meant that the target 

35 sequence may be selected at the discretion of the practitioner 
on the basis of known or predicted sequence information, and is 
not constrained to specific sites recognized by certain. site- 
specific recombinases (e.g., FLP rec mbinase or CRE 
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'recombinase) . In some embodiments, the predetermined 
endogenous DNA target sequence will be other than a naturally 
occurring germline DNA sequence (e.g., a transgene, parasitic, 
or mycoplasmal or viral sequence) . An exogenous polynucleotide 
is a polynucleotide which is transferred into a eukaryotic cell 
but which has not been replicated in that host cell; for 
example, a virus genome polynucleotide that enters a cell by 
fusion of a virion to the cell is an exogenous polynucleotide, 
however, replicated copies of the viral polynucleotide 
subsequently made in the infected cell are endogenous sequences 
(and may, for example, become integrated into a cell 
chromosome) . Similarly, transgenes which are microinjected or 
transfected into a cell are exogenous polynucleotides, however 
integrated and replicated copies of the transgene (s) are 

' endogenous sequences . 

The term "corresponds to" is used herein to mean that 
a polynucleotide sequence is homologous (i.e., is identical, 
not strictly evolutionarily related) to all or a portion of a 
reference polynucleotide sequence, or that a polypeptide 
sequence is identical to a reference polypeptide sequence. In 
contradistinction, the term "complementary to" is used herein 
to mean that the complementary sequence is homologous to all or 
a portion of a reference polynucleotide sequence. For 
illustration, the nucleotide sequence "TATAC" corresponds to a 
reference sequence "TATAC" and is complementary to a reference 

sequence "GTATA". 

The following terms are used to describe the sequence 
relationships between two or more polynucleotides: "reference 
sequence", "comparison window", "sequence identity", 
"percentage of sequence identity", and "substantial identity". 
A "reference sequence" is a defined sequence used as a basis 
for a sequence comparisibn; a reference sequence may be a 
subset of a larger sequence, for example, as a segment of a 
full-length cDNA or gene sequence given in a sequence listing, 
such as a human cystic fibrosis gene (Riordan et al. (1989) 
science 2A5; 1066; Rommens et al. (1989) ^pjepce 1059), or 

may comprise a complete full-1 ngth cDNA or gene sequence. 
Generally, a reference sequence is at least 18 nucleotides in 
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length, typically at least 25 nucleotides in length, frequently 
at least 50 nucleotides in length. Since two polynucleotides 
may each (1) comprise a sequence (i.e., a portion of the 
complete polynucleotide sequence) that is similar between the 
5 two polynucleotides, and (2) may further comprise a sequence 
that is divergent between the two polynucleotides, sequence 
comparisons between two (or more) polynucleotides are typically 
performed by comparing sequences of the two polynucleotides 
over a "comparison window" to identify and compeure local 

10 regions of sequence similarity. A "comparison window", as used 
herein, refers to a conceptual segment of at least 18 
contiguous nucleotide positions wherein a polynucleotide 
sequence may be compared to a reference sequence of at least 18 
contiguous nucleotides and wherein the portion of the 

15 polynucleotide sequence in the compsurison window may comprise 
additions or deletions (i.e., gaps) of 20 percent or less as 
compared to the reference sequence (which does not comprise 
additions or deletions) for optimal alignment of the two 
sequences. Optimal alignment of sequences for aligning a 

20 comparison window may be conducted by the local homology 

algorithm of Smith and Waterman (1981) Adv. AdpI . Math. 2: 482, 
by the homology alignment algorithm of Needleman and Wunsch 
(1970) J. Mol, Biol. 48; 443, by the search for similarity 
method of Pearson and Lipman (1988) Proc. Na tl. Acad. Sci. 

25 fU.S.A, ) 85 ; 2444, by computerized implementations of these 
algorithms (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin 
Genetics Software Package, Genetics Computer Group, 575 Science 
Dr., Madison, WI) , or by inspection, and the best alignment 
(i.e., resulting in the highest percentage of homology over the 

30 comparison window) generated by the various methods is 
selected. The term "sequence identity" means that two 
polynucleotide sequences are identical (i.e., on a nucleotide- 
by-nucleotide basis) over the window of comparison. The term 
"percentage of sequence identity" is calculated by comparing 

35 two optimally aligned sequences over the window of comparison, 
determining the niimber of positions at which the identical 
nucleic acid base (e.g., A, T, C, G, U, or I) occurs in both 
sequences to yield the nvimber of matched positions, dividing 
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the number of matched positions by the t tal number of 
positions in the window of comparision (i.e., the window size), 
and multiplying the result by 100 t yield the percentage of 
sequence identity. The term "substantial identity" may be used 
5 to denote a characteristic of a polynucleotide sequence, 
wherein the polynucleotide comprises a sequence that has at 
least 80 percent sequence identity compared to a reference 
sequence over a comparison window of at least 18 nucleotide 
positions, wherein the percentage of sequence identity is 
10 calculated by comparing the reference sequence to the 
polynucleotide sequence which may include deletions or 
additions which total 20 percent or less of the reference, 
sequence over the window of comparison. 

The terms "substantially corresponds to" or 
15 "substantial identity'' as used herein denotes a characteristic 
of a nucleic acid sequence, wherein a nucleic acid sequence has 
at least about 70 percent sequence identity as compared to a 
reference sequence, typically at least about 85 percent 
sequence identity, and preferably at least about 95 percent to 
20 97 percent sequence identity as compared to a reference 
sequence, frequently having at least 99 percent sequence 
identity. The percentage of sequence identity is calculated 
excluding small deletions or additions which total less than 20 
percent of the reference sequence. The reference sequence may 
25 be a subset of a larger sequence, such as a portion of a gene 
or flanking sequence, or a repetitive portion of a chromosome. 
However, the reference sequence is at least 18 nucleotides 
long, typically at least about 30 nucleotides long, and 
preferably at least about 50 to 100 nucleotides long. 
30 "Svibstantially complementary" as used herein refers to a 

sequence that is complementary to a sequence that substantially 
corresponds to a reference sequence. In general, targeting 
efficiency increases with the length of the targeting 
polynucleotide portion that is substantially complementary to a 
35 reference sequence present in the target DNA. 

"Specific hybridization" is defined herein as the 
formation of hybrids between a targeting polynucleotide (e.g., 
a polynucleotide of the invention which may include 
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substitutions, deletion, and/or additions as compared to the 
pr determined target DNA sequence) and a predetermined target 
DNA, wherein the targeting polynucleotid preferentially 
hybridizes to the predetermined target DNA such that, for 
5 example, a discrete band can be identified on a Southern blot 
of DNA prepared from eukaryotic cells that contain the target 
' DNA sequence, and/or a targeting polynucleotide in an intact 
nucleus localizes to a discrete chromosomal location 
characteristic of a unique or repetitive sequence. In some 
10 instances, a target sequence may be present in more than one 
target polynucleotide species (e.g., a particular target 
sequence may occur in multiple members of a gene family or in a 
known repetitive sequence) . It is evident that optimal 
hybridization conditions will vary depending upon the sequence 
15 ' composition and length (s) of the targeting polynucleotide (s) 
and target (s) , and the experimental method selected by the 
practitioner. Various guidelines may be used to select 
appropriate hybridization conditions (seg, Maniatis et al., 
woleeular Cloning! A T.aboratorv Manual (1989), 2nd Ed., Cold 
20 spring Harbor, N.Y. and Berger and Kimmel, Met:^<?4s in 

Fngivmoloav. VoTuwe 152. PiiHde to Mnlftcular Cloning TechnjcmeS 
(1987), Academic Press, Inc., San Diego, OA., which are 
incorporated herein by reference. Methods for hybridizing a 
targeting polynucleotide to a discrete chromosomal location in 
25 intact nuclei are provided herein in the Detailed Description. 

The term "naturally-occurring" as used herein as 
applied to an object refers to the fact that an object can be 
found in nature. For example, a polynucleotide sequence that 
is present in an organism (including viruses) that can be 
30 isolated from a soxirce in nature and which has not been 

intentionally modified by man in the laboratory is naturally- 
occurring. 

A metabolically-active cell is a cell, comprising an 
intact nucleus, which, when provided nutrients and incubated in 
35 an appropriate medivun carries out DNA synthesis and RNA for 
extended periods (e.g., at least 12-24 hours). Such 
metabolically-active cells are typically differentiated cells 
incapable of further cell division (although nuclear division 
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and chromosomal replication may occur) , although stem cells are 
also metabolically-active cells. 

As used herein, the term "disease allele" refers to 
an allele of a gene which is capable of producing a 
5 recognizable disease. A disease allele may be dominant or 
recessive and may produce disease directly or when present in 
combination with a specific genetic background or pre-existing 
pathological condition. A disease allele may be present in the 
gene pool or may be generated de novo in an individual by 

10 somatic mutation. For example and not limitation, disease 
alleles include: activated oncogenes, a sickle cell anemia 
allele, a Tay-Sachs allele, a cystic fibrosis allele, a 
multidrug resistance (MDR-1) allele, a Gaucher 's Type I allele,- 
a Lesch-Nyhan allele, a retinoblastoma-susceptibility allele, a 

15 Fabry *s disease allele, and a Huntington's chorea allele. As 
used herein, a disease allele encompasses both alleles 
associated with human diseases and alleles associated with 
recognized veterinary diseases. For example, the AF508 CFTR 
allele is a human disease allele which is associted with cystic 

20 fibrosis. 

As used herein, the term "cell-uptake component" 
refers to an agent which, when bound, either directly or 
indirectly, to a targeting polynucleotide, enhances the 
intracellular uptake of the targeting polynucleotide into at 

25 least one cell type (e.g., hepatocytes) . A cell-uptake 

component may include, but is not limited to, the following: a 
galactose-terminal (asialo-) glycoprotein capable of being 
internalized into hepatocytes via a hepatocyte 
asialoglycoprotein receptor, a polycation (e.g., poly-L- 

30 lysine) , and/or a protein-lipid complex formed with the 

targeting polynucleotide. A cell uptake component may include 
a viral capsid protein, viral coat protein, or viral envelope 
protein. Various combinations of the above, as well as 
alternative cell-uptake components will be apparent to those of 

35 skill in the art and are provided in the published literature. 

DETAILED DESCRIPTION 

Generally, the nomenclatxire used hereafter and the 
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laborat ry pr cedures in cell culture, m lecular genetics, and 
nucleic acid chemistry and hybridization described below are 
those well known and c mmonly employed in the art. Standard 
techniques are used for recombinant nucleic acid methods, 
5 polynucleotide synthesis, cell culture, and transgenesis. 
Generally enzymatic reactions, oligonucleotide synthesis, 
oligonucleotide modification, and purification steps are 
performed according to the manufacturer's specifications. The 
techniques and procedures are generally performed according to 

10 conventional methods in the art and various general references 
which are provided throughout this document. The procedures 
therein are believed to be well known in the art and are 
provided for the convenience of the reader. All the 
information contained therein is incorporated herein by 

15 reference. 

Transgenic mice are derived according to Hogan, et 
al., "Manipulating the Mouse Embryo: A Laboratory Manual", 
Cold Spring Harbor Laboratory (1988) which is incorporated 
herein by reference. 

20 Embryonic stem cells are manipulated according to 

published procedures (Teratocarcinomas and embryonic stem 
cells: a practical approach, E.J. Robertson, ed., IRL Press, 
Washington, D.C., 1987; Zjilstra et al., Nature 342:435-438 
(1989); and Schwartzberg et al.. Science 246:799-803 (1989), 

25 each of which is incorporated herein by reference) . 

Oligonucleotides can be synthesized on an Applied Bio 
Systems oligonucleotide synthesizer according to specifications 
provided by the manufacturer. 

30 Targeting Polynucleotides 

Targeting polynucleotides may be produced by chemical 
synthesis of oligonucleotides, nick-translation of a doxible- 
stranded DNA template, polymerase chain-reaction amplification 
of a sequence (or ligase chain reaction eunplif ication) , 

35 piirif ication of prokaryotic or eukaryotic cloning vectors 
harboring a sequence of interest (e.g., a cloned cDNA or 
genomic clone,' or portion thereof) such as plasmids, phagemids, 
YACs, cosmids, bacteriophage DNA, other viral DNA or 



i 
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replication intennediates , or purified restriction fragments 
thereof, as well as other sources of single and double-stranded 
polynucleotides having a desired nucleotide sequence. 
Targeting polynucleotides are generally ssDNA or dsDNA, most 

preferably dsDNA. 

Targeting polynucleotides are generally at least 
about 50 to 100 nucleotides long, preferably at least about 250 
to 500 nucleotides long, more preferably at least about 1000 to 
2000 nucleotides long, or longer; however, as the length of a 
targeting polynucleotide increases beyond about 20,000 to 
50,000 nucleotides, the efficiency of transferring an intact 
targeting polynucleotide into the cell decreases. The length 
of homology may be selected at the discretion of the 
practitioner on the basis of the sequence composition and 
complexity of the predetermined endogenous target DNA 
sequence (s) and guidance provided in the art, which generally 
indicates that 1.3 to 6.8 kilobase segments of homology are 
preferred (Hasty et al. (1991) T^olep. Ce^l- pjoj,. 11: 5586; 
Shulman et al. (1990) t*""!^^- Biol- Ifi' 4*66, which are 

incorporated herein by reference) . Targeting polynucleotides 
have at least one sequence that substantially corresponds to, 
or is substantially complementary to, a predetermined 
endogenous DNA sequence (i.e., a DNA sequence of a 
polynucleotide located in a exikaryotic cell, such as a 
chromosomal, mitochondrial, chloroplast, viral, episomal, or 
mycoplasmal polynucleotide) . Such targeting polynucleotide 
sequences serve as templates for homologous pairing with the 
predetermined endogenous sequence (s), and are also referred to 
herein as homology clamps. In targeting polynucleotides, such 
homology clamps are typically located at or near the 5' or 3« 
end, preferably homology clamps are located at each end of the 
polynucleotide (Berinstein et al. (1992) N9^ee. pell, PiPlt 11'- 
360, which is incorporated herein by reference). Without 
wishing to be bound by any particular theory, it is believed 
that the addition of recombinases permits efficient gene 
targeting with targeting polynucleotides having short (i.e., 
about 50 to 500 basepair long) segments of homology, as well as 
with targeting polynucleotides having longer segments of 
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The format! n of heteroduplex joints is n t a 
stringent process; genetic evidence supports the view that the 
classical phenomena of meiotic gene conversion and aberrant 
5 meiotic segregation result in part from the inclusion of 
mismatched base pairs in heteroduplex joints^ and the 
subsequent correction of some of these mismatched base pairs 
before replication. Observations on recA protein have provided 
information on parameters that affect the discrimination of 

10 relatedness from perfect or near-perfect homology and that 

affect the inclusion of mismatched base pairs in heteroduplex 
joints. The ability of recA protein to drive strand exchange 
past all single base-pair mismatches and to form extensively 
mismatched joints in superhelical DNA reflect its role in 

15 recombination and gene conversion. This error-prone process 

may also be related to its role in mutagenesis. RecA-mediated 
pairing reactions involving DNA of ^X174 and G4, which are 
about 70 percent homologous, have yielded homologous 
recombinants (Cunningham et al. (1981) Cell 24; 213) r although 

20 recA preferentially forms homologous joints between highly 

homologous sequences, and is implicated as mediating a homology 
seenrch process between an invading DNA strand and a recipient 
DNA strand, producing relatively stable heteroduplexes at 
regions of high homology. 

25 Therefore, is is preferred that targeting 

polynucleotides of the invention have homology clamps that are 
highly homologous to the predetermined target endogenous DNA 
sequence (s), preferably not containing terminally located 
mismatches, and most preferably isogenic. Typically, targeting 

30 polynucleotides of the invention have at least one homology 

clamp that is at least about 25 to 35 nucleotides long, and it 
is preferable that homology clamps are at least about 50 to 100 
nucleotides long, and more preferably at least about 100-500 
nucleotides long, although the degree of sequence homology 

35 between the homology clamp and the targeted sequence and the 
base composition of the targeted sequence will determine the 
optimal and minimal clamp lengths (e.g., G-C rich sequences are 
typically more thermodynamically stable and will generally 
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require short r clamp length) . Therefore, both homology clamp 
length and the degree of sequence homology can only be 
determined with reference to a particular predetermined 
sequence, but homology clamps generally must be about at least 
5 25-50 nucleotides long (Rubnitz and Subramani (1985) Molt g^U- 
Biol. S: 659; Ayares et al. (1986) ?rog, V^tl, ftcadt Sgii 
m.s.A.> £2: 5199; Campbell et al. (1989) T^ft New p^olpgist: 1: 
223) and must also substantially correspond or be substantially 
complementary to a predetermined target sequence. Preferably, 
0 a homology clamp is at least about 50 nucleotides long and is 
identical to or complementary to a predetermined target 
sequence. Without wishing to be bound by a particular theory, 
it is believed that the addition of recombinases to a targeting 
polynucleotide enhances the efficiency of homologous 
5 recombination between homologous, nonisogenic sequences (e.g., 
between an exon 2 sequence of a albumin gene of a Balb/c mouse 
and a homologous albumin gene exon 2 sequence of a C57/BL6 
mouse) , as well as between isogenic sequences. 

The invention is preferably practiced with a 
0 complementary pair of targeting polynucleotides, usually of 
equal length, which are simultaneously or contemporaneously 
introduced into a exikaryotic cell harboring a predetermined 
endogenous target sequence, generally with at least one 
recombinase protein (e.g., recA) . Under most circumstances, it 
25 is preferred that the targeting polynucleotides are incubated 
with recA or other recombinase prior to introduction into a 
eukaryotic cell, so that the recombinase protein (s) may be 
"loaded" onto the targeting polynucleotide (s) . Incubation 
conditions for such recombinase loading are described infra, 
30 and also in U.S. S.N. 07/755,462, filed 4 September 1991; 
U.S. S.N. 07/910,791, filed 9 July 1992; and U.S. S.N. 
07/520,321, filed 7 May 1990, each of which is incorporated 
herein by reference. A targeting polynucleotide may contain a 
sequence that enhances the loading process of a recombinase, 
35 for example a recA loading sequence is the recombinogenic 

nucleation sequence poly-[d(A-C) ] , and its complement, poly- 
[d(G-T)]. The duplex sequence poly[d(A-C) d(G-T)]n, where n is 
from 5 to 25, is a middle repetitive element in eukaryotic DNA 
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The invention may also be practiced with individual 
targeting polynucleotides which do not comprise part of a 
complementary pair. In each case, a targeting polynucleotide 
is introduced into a eukaryotic cell simultaneously or 
contemporaneously with a recombinase protein, typically in the 
form of a coated targeting polynucleotide (i.e., a 
polynucleotide preincubated with recombinase wherein the 
recombinase is noncovalently bound to the polynucleotide) . 

A targeting polynucleotide used in a method of the 
invention typically is a single-stranded nucleic acid, usually 
a DNA strand, or derived by denaturation of a duplex DNA, which 
is complementary to one (or both) strand (s) of the target 
duplex nucleic acid. The homology clamp sequence preferably 
contains at least 90-95% sequence homology with the target 
sequence, to insure sequence-specific targeting of the 
targeting polynucleotide to the endogenous DNA target. 
Usually, it is preferable that the homology clamps are isogenic 
with the target polynucleotide. The single-stranded targeting 
polynucleotide is typically about 50-600 bases long, although a 
shorter or longer polynucleotide may also be employed. 
Alternatively, the targeting polynucleotide may be prepared in 
single-stranded form by oligonucleotide synthesis methods, 
which may first require, especially with larger targeting 
polynucleotides, formation of subfragments of the targeting 
polynucleotide, typically followed by splicing of the 
subfragments together, typically by enzymatic ligation. 

Recombin ase Proteins 

Recombinases are proteins that, when included with an 
exogenous targeting polynucleotide, provide a measurable 
increase in the recombination frequency and/ or localization 
frequency between the targeting polynucleotide and an 
endogenous predetermined DNA sequence. In the present 
invention, recombinase refers to a family of RecA-like 
recombination proteins all having essentially all or most of 
the same fvmctions, particularly: (i) th recombinas 
protein's ability to properly bind to and position targeting 
polynucleotides on their homologous targets and (ii) the 
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ability of recombinase pr tein/targeting polynucleotide 
complexes to efficiently find and bind t complementary 
end genous sequences. The best characterized recA protein is 
from E. coli, in addition to the wild-type protein a number of 
5 mutant recA-like proteins have been identified (e.g., recA803) . 
Further, many organisms have recA-like recombinases with 
strand-transfer activities (e.g., Fugisawa et al. , (1985) Nucl. 
Acids Res. 13: 7473; Hsieh et al., (1986) Cell 44: 885; Hsieh 
et al., (1989) .T. Biol. Chem. 264: 5089; Fishel et al., (1988) 
10 pr-oe. Natl- Acad. Sei. USA 85; 3683; Cassuto et al. , (1987) 

Mnl. Gen. Genet. 208; 10; Ganea et al. , (1987) MoT- Cell Biol. 
2: 3124; Moore et al., (1990) .t. Biol. Chem. i9: 11108; Keene 
et'al., (1984) Kuel. Acids Res. 12; 3057; Kimiec, (1984) Csld 
RpT-ina Harbor Svmo. ^lS15', Kimeic, (1986) £sli MJ 545; 
15 Kolodner et al., (1987) Proc. Natl- Acad. Sci. USA 84 :5560; 
Sugino et al., (1985) Proe. Natl . Acad. Sei. USA fiS; 3683; 
Halbrook et al., (1989) J. Biol. Chem. ZM.'- 21403; Eisen et 
al., (1988) Proc. Natl. Acad. Sci. USA 85; 7481; McCarthy et 
al., (1988) Proc. Natl. Acad. Sci. USA 85; 5854; Lowenhaupt et 
20 al., (1989) J- Biol. Chem. 264 ; 20568, which are inisorporated 
herein by reference. Examples of such recombinase proteins 
include, for example but not limitation: recA, recA803, uvsX, 
and other recA mutants and recA-like recombinases (Roca, A.I. 
(1990) grit. Rev. Biochem. Molec. Biol. 25; 415) , SSEl 
25 (Kolodner et al. (1987) Proc. Nat3 . Acad. Sci. fU.S.A.) £45 
5560; Tishkoff et al. Molec. Cell. Biol. 11; 2593), Bav£ 
(Dunderdale et al. (1991) Nature 354: 506), DST2, KEMl, XRNl 
(Dykstra et al. (1991) MoIac. Cell. Biol. H: 2583) , STPo/DSTl 
(Clark et al. (1991) Molec. Cell. Biol. H; 2576) , HPP-1 (Moore 
30 et al. (1991) Proc. Natl. Acad. Sci. fU.S.A.^ 88: 9067) , other 
eukaryotic recombinases (Bishop et al. (1992) Cell 6i: 439; 
Shinohara et al. (1992) Cell 69: 457); incorporated herein by 
reference. RecA may be purified from E. coli strains, such as 
E. coli strains JC12772 and JC15369 (available from A.J. Clark 
35 and M. Madiraju, University of California-Berkeley) . These 
strains contain the recA coding sequences on a "runaway" 
replicating plasmid vector present at a high copy numbers per 
cell. The recA803 protein is a high-activity mutant f wild- 
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type recA. The art teaches several examples of rec mbinase 
proteins, for example, from Drosophila, yeast, plant, human, 
and non-human mammalian cells, including pr teins with 
biological properties similar to recA (i.e., recA-like 
5 recombinases) • 

Recombinase protein (s) (prokaryotic or eukaryotic) 
may be exogenously administered to a eukaryotic cell 
simultaneously or contemporaneously (i.e., within about a few 
hours) with the targeting polynucleotide (s) . Such 

10 administration is typically done by microinjection, although 
electroporation, lipofection, and other transfection methods 
known in the art may also be used. Alternatively, recombinase 
proteins may be produced jji vivo from a heterologous expression 
cassette in a transfected cell or transgenic cell, such as a 

IS transgenic totipotent embryonal stem cell (e.g., a murine ES 
cell such as AB-1) used to generate a transgenic non-human 
animal line or a pluripotent hematopoietic stem cell for 
reconstituting all or part of the hematopoietic stem cell 
population of an individual, or a terminally differentiated 

20 cell type, such as a fibroblast, epithelial cell (e.g., 

pneumocyte) , or muscle cell. Conveniently, a heterologous 
expression cassette includes a modulatable promoter, such as an 
ecdysone-inducible promoter-enhancer combination, an estrogen- 
induced promoter-enhancer combination, a CMV promoter-enhancer, 

25 an insulin gene promoter, or other cell-type specific, 

developmental stage-specific, hormone- inducible, or other 
modulatable promoter construct so that expression of at least 
one species of recombinase protein from the cassette can by 
modulated for transiently producing recombinase (s) in vivo 

30 simultaneous or contemporaneous with introduction of a 
targeting polynucleotide into the cell. When a hormone- 
inducible promoter-enhancer combination is used, the cell must 
have the required hormone receptor present, either naturally or 
as a consequence of expression a co-transfected expression 

35 vector encoding such receptor. 

For making transgenic non-h\iman animals (which 
include homolbgously targeted non-human animals) embryonal stem 
cells (ES cells) are preferred. Murine ES cells, such as AB-1 
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line grown on mitotically inactive SNL76/7 cell feeder lay rs 
(McMahon and Bradley, Cell 1073-1085 (1990)) essentially as 
described (R bertson, E.J. (1987) in Teratocarcinomas and 
pmbrvonic Stem Cells: A Practical Approach . E.J. Robertson, ed. 
5 (Oxford: IRL Press) , p. 71-112) may be used for homologous gene 
targeting. Other suitable ES lines include, but are not 
limited to, the E14 line (Hooper et al. (1987) Nature 326 ; 292- 
295) , the D3 line (Doetschman et al. (1985) J> Embrvol. Exp. 
Morph. 87: 27-45), and the CCE line (Robertson et al. (1986) 

10 Mature 323 ; 445-448) . The success of generating a mouse line 
from ES cells bearing a specific targeted mutation depends on 
the pluripotence of the ES cells (i.e., their ability, once 
injected into a host blastocyst, to participate in 
embryogenesis and contribute to the germ cells of the resulting 

15 animal) . 

The pluripotence of any given ES cell line can vary 
with time in culture and the care with which it has been 
handled. The only definitive assay for pluripotence is to 
determine whether the specific population of ES cells to be 

20 used for targeting can give rise to chimeras capable of 

germline transmission of the ES genome. For this reason, prior 
to gene targeting, a portion of the parental population of AB-1 
cells is injected into C57B1/6J blastocysts to ascertain 
whether the cells are capable of generating chimeric mice with 

25 extensive ES cell contribution and whether the majority of 
these chimeras can transmit the ES genome to progeny. 

The vectors containing the DNA segments of interest 
can be transferred into the host cell by well-)cnown methods, 
depending on the type of cellular host. For example, 

30 microinjection is commonly utilized for eukaryotic cells, 
although calcium phosphate treatment, electroporation, 
lipofection, biolistics or viral-based transfection also may be 
used. Other methods used to transform mammalian cells include 
the use of Polybrene, protoplast fusion, and others (see, 

35 generally . Sambrook et al. Molecular Cloning: A Laboratory 

Manual, 2d ed., 1989, Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor/ N.Y., which is incorporated herein by 
reference) . Direct injection of DNA and/or recombinase-coated 
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targeting polynucleotides int target cells, such as skeletal 
or muscle cells also may be used (Wolff et al. (1990) Science 
247 ; 1465, which is incorp rated herein by reference). 

RecA protein is typically obtained from bacterial 
5 strains that overproduce the protein: wild-type E. coli recA 
protein and mutant recA803 protein may be purified from such 
strains. Alternatively, recA protein can also be pxirchased 
from, for example, Pharmacia (Piscataway, NJ) . 

RecA protein forms a nucleoprotein filament when it 

10 coats a single-stranded DNA. In this nucleoprotein filament, 
one monomer of recA protein is bound to about 3 nucleotides. 
This property of recA to coat single-stranded DNA is 
essentially sequence independent, although particular sequences 
favor initial loading of recA onto a polynucleotide (e.g., 

15 nucleation sequences). The nucleoprotein filament (s) can be 
formed on essentially any DNA molecule and can be formed in 
cells (e.g., mammalian cells), forming complexes with both 
single-stranded and double-stranded DNA. 

20 Recombinase Coating of Targeting Polynucl eotides 

The conditions used to coat targeting' polynucleotides 
with recA protein and ATP7S have been described in commonly 
assigned U.S. S.N. 07/910,791, filed 9 July 1992; U.S. S.N. 
07/755,462, filed 4 September 1991; and U.S. S.N. 07/520,321, 

25 filed 7 May 1990, each incorporated herein by reference. 

Targeting polynucleotides can be coated using GTP7S, mixes of 
ATP7S with rATP and/ or dATP, or dATP or rATP alone or in the 
presence of an rATP generating system (Boehringer Mannheim) . 
Various mixtxires of GTP7S, ATP7S, ATP, ADP, dATP and/or rATP 

30 may be used, particularly preferred are mixes of ATP7S and ATP 
or ATP7S and ADP. 

RecA protein coating of targeting polynucleotides is 
typically carried out as described in W091/17267 and 
WO93/05178. Briefly, the targeting polynucleotide, whether 

35 dotible-stranded or single-stranded, is denatured by heating in 
an aqueous solution at 95-100'*C for five minutes, then placed 
in an ice bath for 20 seconds to ab ut one minut followed by 
centrifugation at 0*C for approximately 20 sec, before use. 
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When denatvired targeting polynucle tides are not placed in a 
freezer at -20'C they are usually immediately added t standard 
recA coating reaction buffer containing ATP7S, at room 
temperature, and to this is added the recA protein. 
Alternatively, recA protein may be included with the buffer 
components and ATP7S before the polynucleotides are added. 

RecA coating of targeting polynucleotide (s) is 
initiated by incubating polynucleotide-recA mixtures at 37 
for 10-15 min. RecA protein concentration tested during 
reaction with polynucleotide varies depending upon 
polynucleotide size and the amount of added polynucleotide, and 
the ratio of recA molecule: nucleotide preferably ranges between 
about 3:1 and 1:3. When single-stranded polynucleotides are 
recA coated independently of their homologous polynucleotide 
strands, the mM and fM concentrations of ATP7S and recA, 
respectively, can be reduced to one-half those used with 
double-stranded targeting polynucleotides (i.e. recA and ATP7S 
concentration ratios are usually kept constant at a specific 
concentration of individual polynucleotide strand, depending on 
whether a single- or double-stranded polynucleotide is used) . 

RecA protein coating of targeting polynucleotides is 
normally carried out in a standard IX RecA coating reaction 
buffer. lOX RecA reaction buffer (i.e., lOx AC buffer) consists 
of: 100 mM Tris acetate (pH 7.5 at 37«C) , 20 mM magnesium 
acetate, 500 mM sodivun acetate, 10 mM DTT, and 50% glycerol) . 
All of the targeting polynucleotides, whether double-stranded 
or single-stranded, typically are denatvired before use by 
heating to 95-100 "C for five minutes, placed on ice for up to 
one minute, and subjected to centrifugation (10,000 rpm) at 0»C 
for approximately 20 seconds (e.g., in a Tomy centrifuge). 
Denatvired targeting polynucleotides usually are added 
immediately to room temperature RecA coating reaction buffer 
mixed with ATP7S and diluted with double-distilled H2O, as 
necessary. 

A reaction mixture typically contains the following 
components: (i) 2.4 mM ATP7S; and (ii) between 1-100 ng/nl of 
targeting polynucleotide. To this mixture is added about 1-20 
Hi of recA protein per 10-100 /il of reaction mixture, usually 
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at about 5. 2-11 ,0 mg/ml (purchased from Pharmacia or purified), 
and is rapidly added and mixed. The final reacti n volxime for 
RecA c ating of targeting polynucleotide is usually in the 
range of about 10-500 Ml- RecA coating of targeting 
5 polynucleotide is usually initiated by incubating targeting 
polynucleotide-RecA mixtxires at 37«»C for about 10-15 min. 

RecA protein concentrations in coating reactions 
varies depending upon targeting polynucleotide size and the 
amount of added targeting polynucleotide: recA protein 

10 concentrations are typically in the range of 5 to 50 fM. When 
single-stranded targeting polynucleotides are coated with recA, 
independently of their complementary strands, the 
concentrations of ATP7S and recA protein may optionally be 
reduced to about one-half of the concentrations used with 

15 double-stranded targeting polynucleotides of the same length: 
that is, the recA protein and ATP7S concentration ratios are 
generally kept constant for a given concentration of individual 
polynucleotide strands. 

The coating of targeting polynucleotides with recA 

20 protein can be evaluated in a nxmber of ways. First, protein 
binding to DNA can be examined using band-shift gel assays 
(McEntee et al., (1981) J. Biol. Chem. 256 :8835). Labeled 
polynucleotides can be coated with recA protein in the presence 
of ATP7S and the products of the coating reactions may be 

25 separated by agarose gel electrophoresis. Following incubation 
of recA protein with denatured duplex DNAs the recA protein 
effectively coats single-stranded targeting polynucleotides 
derived from denaturing a duplex DNA. As the ratio of recA 
protein monomers to nucleotides in the targeting polynucleotide 

30 increases from 0, 1:27, 1:2.7 to 3.7:1 for 121-mer and 0, 1:22, 
1:2.2 to 4.5:1 for 159-mer, targeting polynucleotide's 
electrophoretic mobility decreases, i.e., is retarded, due to 
recA-binding to the targeting polynucleotide. Retardation of 
the coated polynucleotide's mobility reflects the saturation of 

35 targeting polynucleotide with recA protein. An excess of recA 
monomers to DNA nucleotides is required for efficient recA 
coating of short targeting polynucleotides (Leahy et al., 
(1986) J. Biol. Chem. 261 :6954). 
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A second method for evaluating protein binding to DNA 
is in the use of nitrocellulose fiber binding assays (Leahy et 
al., (1986) J- Biol, Chem. 261 ;6954; Woodbury^ et al., (1983) 
Biochemistry 22 f 20) :4730-4737, The nitrocellulose filter 
5 binding method is particularly useful in determining the 

dissociation-rates for protein: DNA conplexes using labeled DNA, 
In the filter binding assay, DNA: protein complexes are retained 
on a filter while free DNA passes through the filter. This 
assay method is more quantitative for dissociation-rate 
10 determinations because the separation of DNA: protein complexes 
from free targeting polynucleotide is very rapid. 

Cell-Uptake Components 

A targeting polynucleotide of the invention may 

15 optionally be conjugated, typically by noncovalent binding, to 
a cell-uptake component. Various methods have been described 
in the art for targeting DNA to specific cell types. A 
targeting polynucleotide of the invention can be conjugated to 
essentially any of several cell-uptake components known in the 

20 eort^ For targeting to hepatocytes, a targeting polynucleotide 
can be conjugated to an asialoorosomucoid (ASOR) -poly-L- lysine 
conjugate by methods described in the art and incorporated 
herein by reference (Wu GY and Wu CH (1987) J- Biol> Chem, 262.: 
4429; Wu GY and Wu CH (1988) Biochemistry 27: 887; Wu GY and Wu 

25 CH (1988) J. Biol. Chem. 263 : 14621; Wu GY and Wu CH (1992) 3^ 
Biol. Chem. 267 : 12436; Wu et al. (1991) J. Biol. Chem. 216: 
14338; and Wilson et al. (1992) J. Biol. Chem. 217: 963, 
WO92/06180; W092/05250; and W091/17761 which are incorporated 
herein by reference) . 

30 Alternatively, a cell-uptake component may be formed 

by incubating the targeting polynucleotide with at least one 
lipid species and at least one protein species to form protein- 
lipid-polynucleotide complexes consisting essentially of the 
targeting polynucleotide and the lipid-protein cell-uptake 

35 component. Lipid vesicles made according to Feigner 

(W091/17424, incorporated herein by reference) and/ r cationic 
lipidization (WO91/16024, incorporated herein by reference) or 
other forms for polynucle tide administration (EP 465,529, 
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incorporated herein by reference) may als be employed as cell- 
uptake components. Viral caps id proteins, coat proteins, and 
envelope proteins (e.g., those promoting m mbrane fusion; 
hemagglutinin) . Examples of viral glycoproteins which attach 
5 to cell surface receptors and lead to internalization and/ or 
membrane fusion include the gB, gC, gD, gE, gH, and gl virion 
glycoproteins of HSV-1, and gpl20 of HIV-1. 

Typically, a targeting polynucleotide of the 
invention is coated with at least one recombinase and is 

10 conjugated to a cell-uptake component, and the resulting cell 
targeting complex is contacted with a target cell under uptake 
conditions (e.g., physiological conditions) so that the 
targeting polynucleotide and the recombinase (s) are 
internalized in the target cell. A targeting polynucleotide 

15 " may be contacted simultaneously or sequentially with a cell- 
uptake component and also with a recombinase; preferably the 
targeting polynucleotide is contacted first with a recombinase, 
or with a mixture comprising both a cell-uptake component and a 
recombinase under conditions whereby, on average, at least 

20 about one molecule of recombinase is noncovalently attached per 
targeting polynucleotide molecule and at least about one cell- 
uptake component also is noncovalently attached. Most 
preferably, coating of both recombinase and cell-uptake 
component saturates essentially all of the available binding 

25 sites on the targeting polynucleotide. A targeting 

polynucleotide may be preferentially coated with a cell-uptake 
component so that the resultant targeting complex comprises, on 
a molar basis, more cell-uptake component than recombinase ( s) . 
Alternatively, a targeting polynucleotide may be preferentially 

30 coated with recombinase (s) so that the resultant targeting 

complex comprises, on a molar basis, more recombinase (s) than 
cell-uptake component. 

Cell-uptake components are included with recombinase- 
coated targeting polynucleotides of the invention to enhance 

35 the uptake of the recombinase-coated targeting 

polynucleotide (s) into cells, particularly for in vivo gene 
targeting applications, such as gene therapy to treat gen tic 
diseases, including neoplasia, and targ ted homologous 
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recombination t treat viral infections wherein a viral 
sequence (e.g-, an integrated hepatitis B virus (HBV) gen me or 
genome fragment) may be targeted by homologous sequence 
targeting and inactivated. Alternatively, a targeting 
5 polynucleotide may be coated with the cell-uptake component and 
targeted to cells with a contemporaneous or simultaneous 
administration of a recombinase (e.g., liposomes or 
immixnoliposomes containing a recombinase, a viral-based vector 
encoding and expressing a recombinase) . 

IQ Several disease states may be amenable to treatment 

or prophylaxis by targeted alteration of heptocytes In vivo by 
homologous gene targeting. For example and not for limitation, 
the following diseases, among others not listed, are expected 
to be amenable to targeted gene therapy: hepatocellular 

15 carcinoma, HBV infection, familial hypercholesterolemia (LDL 
receptor defect), alcohol sensitivity (alcohol dehydrogenase 
and/or aldehyde dehydrogenase insufficiency) , hepatoblastoma, 
Wilson's disease, congenital hepatic porphyrias, and inherited 
disorders of hepatic metabolism. Where targeting of hepatic 

20 cells in vivo is desired, a cell-uptake component consisting 

essentially of an asialoglycoprotein-poly-L- lysine conjugate is 
preferred. The targeting complexes of the invention which may 
be used to target hepatocytes in vivo ^ake advantage of the 
significantly increased targeting efficiency produced by 

25 association of a targeting polynucleotide with a recombinase 
which, when combined with a cell-targeting method such as that 
of W092/05250 and/or Wilson et al. (1992) J- Biol. Chem. 267: 
963, provide a highly efficient method for performing in vivo 
homologous sequence targeting in cells, such as hepatocytes. 

30 For many types of in vivo gene therapy to be 

effective, a significant number of cells must be correctly 
targeted, with a minimum number of cells having an incorrectly 
targeted recombination event. To accomplish this objective, 
the combination of: (1) a targeting polynucleotide (s) , (2) a 
35 recombinase (to provide enhanced efficiency and specificity of 
correct homologous sequence targeting) , and (3) a cell-uptake 
component (to' provide enhanced cellular uptake of the targeting 
poynucleotide) , provides a means for the efficient and specific 
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targeting of cells In vivo * making in vivo homologous sequence 
targeting, and gene therapy, practicable. 

Targeting of Endogenous DNA Sequence s In Vivo 
5 Generally, any predetermined endogenous DNA sequence 

can be altered by homologous recombination (which includes gene 
conversion) with an exogenous targeting polynucleotide (or 
complementary pair of targeting polynucleotides) that has at 
least one homology clamp which substantially corresponds to or 

10 is substantially complementary to a predetermined endogenous 
DNA target sequence and which is introduced with a recombinase 
(e.g., recA) into a eukaryotic cell having the predetermined 
endogenous DNA sequence. Typically, a targeting polynucleotide 
(or complementary polynucleotide pair) has a portion having a 

15 sequence that is not present in the preselected endogenous 

targeted sequence (s) (i.e., a nonhomologous portion) which may 
be as small as a single mismatched nucleotide or may span up to 
about several kilobases or more of nonhomologous sequence. 
Generally, such nonhomologous portions are flanked on each side 

20 by homology cleunps, although a single flanking homology clamp 
may be used. Nonhomologous portions are used to meJce 
insertions, deletions, and/ or replacements in a predetermined 
endogenous targeted DNA sequence, and/or to meike single or 
multiple nucleotide substitutions in a predetermined endogenous 

25 target DNA sequence so that the resultant recombined sequence 

(i.e., a targeted recombinant endogenous sequence) incorporates 
some or all of the sequence information of the nonhomologous 
portion of the targeting polynucleotide (s) . Additions and 
deletions may be as small as 1 nucleotide or may range up to 

30 about 2 to 10 kilobases or more. 

In one application, a targeting polynucleotide can be 
used to repair a mutated sequence of a structxiral gene by 
replacing it or converting it to a wild-type sequence (e.g., a 
sequence encoding a protein with a wild-type biological 

35 activity) . For example, such applications could be used to 
convert a sickle cell trait allele of a hemoglobin gene to an 
allele which encodes a hemoglobin molecule that is not 
susceptible to sickling, by altering the nucle tide sequence 
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• encoding the /3-sub\init of hemoglobin so that the c don at 
position 6 of the jg subunit is converted ValiS6 — >Glui86 (Shesely 
et al, (1991) oD.cit, ) . Other genetic diseases can be 
corrected, either partially or totally, by replacing, 
5 inserting, and/ or deleting sequence information in a disease 
allele using appropriately selected exogenous targeting 
polynucleotides. For example but not for limitation, the AF508 
deletion in the human CFTR gene can be corrected by targeted 
homologous recombination employing a recA-coated targeting 
10 polynucleotide of the invention. 

Gene Inactivation 
In addition to correcting disease alleles, exogenous 
targeting polynucleotides can be used to inactivate one or more 

15 genes in a cell (or transgenic nonhiman animal) • 

Once the specific target genes to be modified are 
selected, their sequences will be scanned for possible 
disruption sites (convenient restriction sites, for example). 
Plasmids are engineered to contain an appropriately sized gene 

20 sequence with a deletion or insertion in the gene of interest 
and at least one flanking homology clamp which svibstantially 
corresponds or is substantially complementary to an endogenous 
target DNA sequence. Vectors containing a targeting 
polynucleotide sequence are typically grown in coli and then 

25 isolated using standard molecular biology methods, or may be 
synthesized as oligonucleotides. Direct targeted inactivation 
which does not require vectors may also be done. When using 
microinjection procedures it may be preferable to use a 
transfection technique with linearized sequences containing 

30 only modified target gene sequence and without vector or 

selectable sequences. The modified gene site is such that a 
homologous recombinant between the exogenous targeting 
polynucleotide and the endogenous DNA target sequence can be 
identified by using carefully chosen primers and PGR, followed 

35 by analysis to detect if PGR products specific to the desired 
targeted event are present (Erlich et al., (1991) Science 252: 
1643, which is incorporated herein by reference). Several 
studies have already used PGR to successfully identify and then 
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clone the desired transfected cell lines (Zimmer and Gruss, 
(1989) Nature 338:150; Mouellic et al., (1990) Proc. Natl> 
^ead, scl. USA 87:4712; Shesely et al., (1991) Proc. Natl> 
^ead. sci. USA 88.5 4294, Which are incorporated herein by 
5 reference). This approach is very effective when the ntimber of 
cells receiving exogenous targeting polynucleotide (s) is high 
(i.e., with microinjection, or with liposomes) and the treated 
cell populations are allowed to expand to cell groups of 
approximately 1 x 10^ cells (Capecchi, (1989) Science 
10 244 :1288) . When the target gene is not on a sex chromosome, or 
the cells are derived from a female, both alleles of a gene can 
be targeted by sequential inactivation (Mortensen et al., 
(1991) Proc. Natl, Acad- Sci. USA 88:7036). 

15 ' Homologous Pairing of Targeting Polynucleotides 

Having Chemical Substituents 
Exogenous targeting polynucleotides that have been 
modified with appended chemical substituents may be introduced 
along with recombinase (e.g., recA) into a metabolically active 

20 eukaryotic cell to homologously pair with a predetermined 
endogenous DNA target sequence in the cell. Typically such 
exogenous targeting polynucleotides are derivatized, and 
additional chemical substituents are attached, either during or 
after polynucleotide synthesis, respectively, and are thus 

25 localized to a specific endogenous target sequence where they 
produce an alteration or chemical modification to a local DNA 
sequence. Preferred attached chemical substituents include: 
cross-linking agents, psoralen, metal chelates (e.g., iron/EDTA 
chelate for iron catalyzed cleavage), topoisomerases , 

30 endonucleases , exonucleases, ligases, phosphodiesterases, 
photodynamic porphyrins, chemotherapeutic drugs (e.g., 
adriamycin, doxirubicin) , intercalating agents, base- 
modification agents, immunoglobulin chains, and 
oligonucleotides. Iron/EDTA chelates are particularly 

35 preferred chemical substituents where local cleavage of a DNA 
sequence is desired (Hertzberg et al. (1982) J- Am, Chem, SoCt 
104 ; 313; Hertzberg and Dervan (1984) Biochemistrv 23: 3934; 
Taylor et al. (1984) Tetrahedron 40 : 457; Dervan, PB (1986) 
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Ssi^ 222: 464, Which are in= rporated herein by reference) . 
preferred attachment chemistries include: ^^"f ' ^^^j ' 

via an appended reactive amino group (Corey and Schult. (1988) 
LtsnSS L= "01, Which is incorporated herein by reference) 
and other direct linkage chemistries, although 
streptavidin/biotin and digoxigenin/anti-dlgoxigenxn antibody 
linkage methods may also be used. Methods for linking 
chemical substitutents are provided in U.S. Patents 5,135,720, 
5,093,245, and 5,055,556, which are incorporated herein by 
reference. Other linkage chemistries may be used at the 
discretion of the practitioner. ^ „i»v 

The broad scope of this invention is best understood with 
reference to the following examples, which «:e not intended to 
limit the invention in any manner. 

]r nnMPI.E 1 

T.ra ->1r" ^».^A-Cnnrrd Chemj r^H Y-HP->if 

Pn^Y^»lCl «»n■^^des tt> Cells 
Homologously targeted exogenous targeting 
polynuclotides specifically target human DNA sequences in 
intact nuclei of metabolically active cells. RecA-coated 
complementary exogenous targeting polynucleotides were 
introduced into metabolically active human cells encapsulated 
in agarose microbeads and permeabilized to permit entry of 
DNA/protein complexes using the Jackson-Cook method (Cook, P.R. 
(1984) EMBO J . 3: 1837; Jackson and Cook (1985) SffiO-i^ l*- 
919; Jackson and Cook (1985) ±'' 9"; Jackson and cook 

(1986) -T- Mni. Biol. 121: 65; Jackson et al. (1988) J. Ce^t 
sci. afi: 365, which are incorporated herein by reference). 
These experiments were designed to specifically target 
homologous DNA sequences with recA protein in intact nuclei of 
metabolically active human HEp-2 cells. 

J Jackson and Cook previously demonstrated that the 

nuclear membranes of human or other cells may be permeabilxzed 
without loss of metabolic function if the cells are first 
encapsulated in a gel of agarose microbeads. The agarose 
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mlcrobead coat contains the cell c nstituents and preserves 
native confonnati n of chromososomal DNA, while permitting 
diffusion of nacromolecules into and out of the cell 
c mpartment. Wittig et al. (1991) Proc. Natl, Acad, Sci. 
5 fU.S,A, ) 88 ; 2259, which is incorporated herein by reference, 
demonstrated that monoclonal antibodies directed against 
left-handed Z-DNA could be diffused into these ag2u:ose-embedded 
cells, and that the antibodies were specifically targeted to 
chromosomal sequences and conformations. In a similar manner, 

10 we incxibated biotin- or FITC-labeled complementary DNA 

targeting polynucleotides coated with recA with agarose-coated 
cell nuclei and verified the correct homologous targeting of 
the exogenous targeting polynucleotides to specific 
predetermined human DNA sequences in cell nuclei of 

15 metabolically active cells. 

RecA-mediated homologous gene targeting with 
complementary oligonucleotides in intact human cell nuclei was 
verified directly by homologous targeting using targeting 
polynucleotides that were biotinylated. These were 

20 subsequently labeled with a fluorescent compound to verify 
homologous pairing at specific locations having the 
predetermined sequence (s) • RecA-coated targeting 
polynucleotides for human chromosome 1 pericentrometric 
alpha-satellite DNA sequences were specifically targeted to 

25 chromosome 1 centromere sequences in living human cell nuclei 
that were permeabilized and suspended in agarose. 

In these experiments, recA-coated biotinylated 
exogenous targeting polynucleotides containing homologous 
sequences to human chromosome 1 alpha satellite DNA were 

30 incubated with hximan HEp-2 cells. The cells were embedded in 
agarose, then treated with standard buffers (according to 
Jackson and Cook, o p . cit , ) to remove the cytoplasmic membrane 
and cytoplasm immediately before the addition of targeting 
polynucleotide coated with recA protein. 

35 The experiments were performed with the following 

results • 

First, in order to test protocols to be used in 
nuclear encapsulation, freshly trypsinized growing human HEp-2 
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tumor cells were suspended in complete DMEM encapsulated in a 
mixture of agarose (2.5%, Fisher-Bioteck) and complete DMEM 
media adapting the protocols f Nilsson et al., 1983, so that 
the final agarose c ncentration was 0.5% (4 volumes cells in 
suspension with 1 volume 2.5% agarose), and the final cell 
concentration range was approximately 2.4 x lO^ to 8 x lO^. 
The encapsulated cells in agarose "beads" were placed in petri 
dishes to which DMEM complete media was added and were allowed 
to grow for 24 hr in an incubator at 37°C , 7% COj. At 24 hr, 
the cells were clearly growing and multiplying and thus were 
alive and metabolically active. 

An aliquot of agarose containing cells (in beads in 
DMEM medium) was treated to remove the cytoplasmic membrane and 
cytoplasm by addition of ice-cold sterile PBS, New Buffer 
(Jackson et al. (1988) oT..elt.t 130 mM KCl, 10 mM Na2HP04, 1 mM 
MgClj, 1 mM NajATP, and 1 mM dithithreitol, pH 7.4 ), New 
Buffer with 0.5 % Triton-X 100, New Buffer with 0.2% BSA, then 
was centrifuged at low speed using protocols developed by 
Jackson and Cook, 1985 and 1986 op.c^t..; Wittig et al. (1989) 
■T, r^n. Btol. IMS 755; Wittig et al. (1991) oprCi%, ) who have 
shown that this treatment allows the nuclear membrane to remain 
morphologically intact. The nuclei are • metabolically active as 
shown by a DNA synthesis rate of 85 to 90% compared with that 
of untreated control cells. 

Cytoplasm was effectively removed by the above 
treatment, and the encapsulated nuclei were intact as 
demonstrated by their morphology and exclusion of 0.4% trypan 
blue. Nuclei in agarose were returned to the humidified COj 
incubator at 37-C for 24 hr and remained metabolically active, 
we observed that filtered, sterile mineral oil used in the 
emulsification process was difficult to remove entirely and 
interfered with the microscopic visualization of suspended 
nuclei. Therefore, the cell-agarose suspension process was 
simplified. In subsequent experiments cells were gently 
vortexed with melted (39»C) agarose, then the agarose-cell 
mixture was st rilely minced before New Buffer treatments. 
This simpler process, eliminating the oil step, makes it easier 
to visualize the cells and chromes mes at the completion of 
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reactions • 

After mincing of the agar and N w Buffer treatments 
of the cells, the above protocols were used to h mologously 
target endogen us DNA sequences in encapsulated nuclei as 
5 follows: 16.5 /xl recA-coated (or non-recA-coated control) 
nick-translated DNA (labeled with biotin-14-dATP) targeting 
polynucleotide was prepared and boxind under standard native 
recA protocols (see W091/17267 and WO93/05178) . Minced agarose 
fragments were centrifuged and New Buffer supernatant removed. 

10 The fragments were resuspended in 1 X AC buffer in a 1.5-ml 
Eppendorf tube, then centrifuged for removal of the buffer 
(leaving an estimated 50 to 75 fil of buffer) , and prepared 
targeting polynucleotide was mixed with the fragments of 
agarose-containing nuclei. Reactions were incxibated in a 37 'C 

15 water bath for 2 to 4 hr, then washed, incubated in standeurd 
preblock solution, then in preblock supplement with 10 fig/nl 
FITC-avidin (Vector, DCS grade) , and again washed. 
Experimental results were analyzed by placing a minute amo\int 
of a reaction with 3 to 4 aiI antifade on a slide with a slide 

20 cover and viewing it by using the Zeiss CLSM-10 confocal laser 
scanning microscope (CLSM) . Completed reactions were also 
stored refrigerated for later examination. 

In the first in vivo experiment, metabolically active 
HEp-2 cells suspended in 1 x PBS were encapsulated in agarose 

25 by gentle vortexing, treated using New Buffer protocols, then 
incubated for 3 hr 15 min with 100 ng of recA-coated targeting 
polynucleotide specific for Chromosome 1 alpha-satellite DNA 
biotinylated with bio-14-dATP by nick translation (BRL, Nick 
Translation System) using pUC 1.77 plasmid DNA (a 1.77 kb long 

30 EcoRI fragment of human DNA in the vector pUC9; Cooke et al. 

(1979) Nucleic Acids Res. 6: 3177; Emmerich et al. (1989) Exp. 
Cell. Res. 181 : 126) . We observed specific targeting by the 
alpha-satellite targeting polynucleotide to pericentromeric 
chromosome 1 targets in intact nuclei of metabolically active 

35 cells. The number and types of signals were essentially 

identical to th se using the same targeting polynucleotide with 
methanol (or ethan 1) -fixed HEp-2 cell targets in suspension. 
Figures lA-lD and 2A-2C show specific targeting signals in 



SUBSTITUTE SHEET 



wo 93/22443 PCr/US93/03868 . 

43 

metabolically active cells from this experiment. 

In th second in vivo experiment, cells suspended in 
inc mplete DMEM m dia instead of 1 x PBS were encapsulated in 
agarose and treat d with 62.5 ng of the same targeting 
5 polynucleotide used in the first experiment described above and 
62.5 ng of a freshly biotinylated targeting polynucleotide 
prepeired under the same protocols. In this experiment, the 
minced agarose fragments were not resuspended in 1 x AC buffer 
before addition of targeting polynucleotide and some nuclei 

10 disintegrated, especially with subsequent centrifugation. The 
results show that in the nuclei that remained intact, the 
targeting polynucleotides coated with recA specifically 
teirgeted predetermined human DNA targets. In contrast, 
targeting polynucleotides in reactions without recA did not 

15 correctly target the predetermined human DNA sequences. When 
the targeted DNA (gienerated from the recA-coated targeting 
polynucleotides) was decondensed from the nuclei, the 
alpha-satellite repeat sequences showed precise and evenly 
spaced signals along the "string" of the alphoid satellite DNA 

20 sequences. 

Thus, the recA-coated targeting polynucleotides were 
targeted to the repetitive alpha satellite sequences of 
Chromosome 1. This result showed DNA targeting in intact 
nuclei to specific human Chromosome 1 sec[uences. An example of 

25 the experimentally extended DNA with specific alpha-satellite 
signals appears in Figures 3A and 3B. 

In the third experiment, cells were suspended in 1 x 
PBS or in incomplete DMEM media before vortexing with agarose 
and were tested using 62.5 ng of targeting polynucleotide in 

30 reactions with and without recA protein. In addition, the. 
reactions were divided in half and washed and FITC-avidin 
treated in either buffer adjusted to pH 7 or pH 7.4. Cells 
were incubated with the recA coated targeting polynucleotide 
for 3 hr 25 min. Live nuclei treated with targeting 

35 polynucleotid alone without recA showed no signals. In th 

recA-treated reactions, relatively weaker signals were observed 
in nuclei incubated in 1 x PBS, wher as very strong specific 
signals were present in nuclei that had been incubated in 
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inc mplete DMEM. Ther was clearly significantly more signal 
present in nuclei that wer washed and treated with FITC-avidin 
at pH 7.4 compared with nucl i incubated at pH 7.0. Figures 5A 
and 5B show nuclei that were treated with recA coated targeting 
5 polynucleotides and incubated at both pH 7.4 and 7.0. 

In a fourth experiment, HEp-2 cells were embedded in 
agarose prepared with ix PBS, New Buffer treated, then treated 
with 100 ng of biotinylated targeting polynucleotide 
complementary to Chromosome 1 alpha-satellite DNA. Controls in 
10 this experiment also included reactions without recA protein 

and additional control reactions supplemented with an identical 
amount of BSA protein to replace the recA protein. 
Additionally, cells were also embedded in agarose prepared with 
IX AC buffer. Examples of specific targeting to endogenous 
15 target sequences were "recorded. 

In a fourth experiment, we directly determined if the 
embedded nuclei (under the conditions used above) were 
metabolically active. The nuclei in agarose were incubated 
with bio-.21-rUTP in complete medium, then incubated for 2 days 
20 in the humidified CO^ atmosphere. Bio-21-rUTP was incorporated 
in RNA. After 2 days at 37' the cells were incubated with 
FITC-streptavidin and examined. FITC was specifically 
associated with nucleoli indicative of ribosomal KNA 
biosynthesis, thus directly showing metabolic activity in these 
25 human cells. Similar results were obtained using DNA 

precursors to measure DNA synthesis. In this experiment it was 
clear that the majority of nuclei in the PBS agarose reaction 
had condensed chromosomes. There was nuclear division in a 
number of these nuclei also, indicative of full metabolic 
30 viability, which was also shown in the AC buffer-treated cells. 

A fifth experiment was performed using, again, HEp-2 
cells embedded in agarose. Final concentration of the cells in 
agarose was 3.7 x 10^ /ml. The cells were suspended in 1 x PBS 
prior to combining with agarose. The final agarose 
35 concentration was 0.5%. There were tw reactions, one in which 
recA was used to coat targeting polynucleotid , the s cond in 
which recA protein was replaced by BSA at the same protein 
concentration followed by New Buffer treatments to remove the 
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' cyt plasm. The nuclei in agarose were incubated f r 3 hr wxth 
targeting polynucleotide, then processed for detection of 
c rrectly targeted polynucleotid using the prot cols describ 
previously. FITC-avidin was used to visualize the biotinylated 
5 targeting polynucleotide at a concentration of 20 /xg/ml. 

Results showed that cells with the recA-coated complementary 
targeting polynucleotide displayed specific signals in 25% or 
more of the intact nuclei. In contrast, the BSA-treated 
controls did not show any signal. 
0 cells in agarose from this experiment were further 

incubated at 37 'C in the COj incubator in complete medium. At 
22 hr, these cells were metabolically active. Chromosomes were 
condensed, and a number of nuclei were in the process of 
dividing, in these experiments, a. significant number of the 
15 cells incubated with recA-coated complementary targeting 

polynucleotides showed specific signal, whereas 0% of the cells 
incubated with targeting polynucleotide alone showed specific 
signal. 

In summary, recA-coated biotinylated targeting 
20 polynucleotides for human chromosome 1 alpha-satellite DNA were 
specifically targeted to human HEp-2 epithelial carcinoma 
chromosomal DNA in intact cell nuclei of metabolically activ 
cells that had been suspended in agarose, then treated with 
buffers and recA-coated targeting polynucleotides under 
25 suitable reaction conditions (supya and W091/17267 and 

WO93/05178). specific binding by the recA-coated targeting 
polynucleotide to chromatin alpha-satellite DNA was observed 
only in the agarose embedded nuclei which were incubated with 
recA-coated targeting polynucleotides. Control nuclei 
30 incubated with targeting polynucleotides in the absence of recA 
and/or with nonspecific protein exhibited no signal. 

yaraetino of Hur ^n p''^ Gene 
We performed recA-mediated homologous targeting of 
35 biotinylated targeting polynucleotides that were homologous to 
the hukan p53 tumor supressor gene, and compared the results to 
targeting of alpha satellite DNA sequences in human chromosom 
1. In these exp^iments, exponentially growing cells were 
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trypsinized, washed, suspended in inc mplete medium and 
encapsulated in agarose. The agarose was minced into pieces 
with a raz r blade and the encapsulated cells were treated with 
New Buffer. A sample from each group was removed to verify 
that nuclei were intact. 

Nuclei were washed in 1 x AC buffer and incubated 
with recA-coated complementary single-stranded DNA 
oligonucleotides (i.e., exogenous targeting polynucleotides) 
for 3.5 hours at 37®C. The alpha satellite DNA targeting 
polynucleotides for chromosome 1 were previously described and 
were nick-translated with biotinylated deoxyribonucleotides 
(bio-14-dATP) . The p53 tumor suppressor gene polynucleotide 
was obtained from Oncor (209 Perry Parlcway, Gaithersburg, MD 
20877) and is a 1.2 kilobase cDNA fragment from a wild-type 
human p53 gene (Fields and Jang, (1990) gcj^ence ZAU 1046; 
Miller et al. (1986) Nature 211'' 783; Zakut-Houre et al. (1985) 
EMBo J. 4.: 1251). The 1.2 kilobase human p53 DNA was nick- 
translated with biotinylated deoxyribonucleotides and yielded a 
population of biotinylated targeting polynucleotides having a 
size range (about 100 to 600 nucleotides) similar to that 
obtained for the human chromosome 1 alpha .satellite targeting 
polynucleotides. The targeting polynucleotides were separately 
incubated with encapsulated cells. Following incubation 3 
washes of 1.75 x SSC were done, and sampled nuclei were 
verified as intact after the washing step. After washing, the 
targeted encapsulated cell nuclei were incubated in preblock 
and FITC-avidin was added to preblock buffer to a final 
concentration of 20 ng/nl for 15 minutes in the dark. The 
targeted encapsulated cell nuclei were washed sequentially in 4 
X SSC, 4 X SSC with 0.1% Triton X-100,, and then 4 x SSC. 
Samples of nuclei were again taken and used to verify that the 
targeted nuclei were metabolically active. Microscopic 
examination showed that metabolically active cells contained 
specific FITC-targeting polynucleotide: targeted endogenous 
sequenc complexes (shown in Figure 4) . The p53 targeting 
polynucleotides were specifically targeted to human chromosome 
17, the location of the endogenous htiman p53 gene sequences, 
indicating sp cif ic pairing of a targeting polynucleotide to a 
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unique endogenous DNA target sequence. The human chromosome 1 
alpha satellite DNA was also specifically targeted to the 
chromos me 1 pericentromeric satellite sequences. 

Th experiments validated a highly specific DNA 
targeting technique for human or other cells as evidenced by 
homologous sequence targeting techniques in metabolically 
active cells. The targeting technique employs the unique 
properties of recA-mediated DNA sequence targeting with 
single-stranded (complementary) short targeting 
polynucleotides. Native intact nuclei were incubated with 
labeled, heat-denatured targeting polynucleotides coated with 
recA protein. The DNA hybridized to the predetermined targeted 
homologous sequences. In these experiments, the targeting 
polynucleotides formed paired complexes with specific gene 
sequences within metabolically active cell nuclei. This in 
vivo targeting by recA-mediated homologous targeting 
polynucleotides shows the targeting specificity and therapeutic 
potential for this new in vivo methodology. Application of 
recA or other recombinase-mediated targeting of (complementary) 
ssDNA or denatured dsDNA targeting polynucleotides to 
predetermined endogenous DNA targets is important for human 
gene entry, gene knockout, gene replacement, and gene mutation 
or correction. 

5 EXAMPLE 2 

pnt-rectina a Mutant Gene to Produ c e a Funetaonal Gene Product 
Homologous ly targeted complementary DNA 
oligonucleotides were used to correct 11 bp insertion mutations 
in vector genes and restore vector gene expression and vector 
0 protein function in microinjected mammalian cells. 

Experiments were designed to test whether a 
homologously targeted complementary oligonucleotide targeting 
polynucleotide of approximately 270-280 bp (276-bp) could 
correct an 11-bp insertion mutation in the lacZ gene of a 
15 mammalian DNA vector which encoded a nonfunctional ^- 

galactosidase, so that a corrected lacZ gene encoded and 
expressed a functional enzyme. Functional enzyme 
(jS-galactosidase) was detected by an X-gal assay that turns 
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cells expressing a revertant (i.e., corrected) lacZ gene a blue 
color . 

NIH3T3 cells microinjected with the mutant t st 
vector bearing an 11 basepair insertion in the lacZ coding 
5 sequence do not produce any detectable functional 

/5-galactosidase enzyme. In contrast, cells microinjected with 
' the wild type test vector do produce functional enzyme. 

We obtained the functional lac plasmid pMCllacpa for 
use as a positive control for expression of ^-galactosidase. 
10 pMCllacXpa is the target test mutant plasmid (shown in Figure 
6) . It is identical to pMCllacpa (shown in Figure 7) but has a 
11-bp Xbal linker insertional mutation. This plasmid does not 
express /S-galactosidase activity in mouse NIH3T3 cells when 
introduced by electroporation. It does not produce blue color 
15 in the presence of X-GAL indicative of /3-galactosidase 

production following vector microinjection. Negative controls 
with mock or noninjected cells we also done. Using these 
controls and NIH3T3 cells have no detectable background blue 
staining. 

20 The plasmid pMCllacpa (8.4 kb) contains the strong 

polyoma virus promoter of transcription plus ATG placed in 
front of the lacZ gene. The polyadenylation signal from SV40 
virus was placed in back of the lacZ gene. The plasmid vector 
was pIB130 (shown in Figure 8) from IBI (New Haven, CT) . The 

25 mutant vector pMCllacpa has a 11-bp insertion at the Xbal site. 
This mutation consists of the inserted sequence CTCTAGACGCG 

(see Figure 9) . 

In several control microinjection experiments using 
pMCllacXpa we consistently failed to detect any blue 

30 microinjected cells. In contrast, in various experiments 

approximately 8 to 13% of the 3T3 cells injected with pMCllacpa 
DNA expressed /S-galactosidase as evidenced by their blue col r. 
No cells microinjected with injection buffer alone or mock 
injected were observed as blue. 

35 We synthesis d two 20-bp primers for producing a 

276-bp PCR product (see Figvire 9) f r m the wild-type lacZ 
sequence for use as targeting polynucleotides. We chose this 
fragment to span the 11 bp insertion mutation as a 
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nonh mologous sequence. The 276-bp DNA oligonucleotide was 
separated by gel electrophoresis and electroeluted from 
agarose, ethanol precipitat d, and its concentration determined 
by absorbance at 260 nm. The fragment was 5« end-labeled with 
5 32p and specifically D-looped with the pMCllacXpa or pMCllacpa 
plasmid DNA using recA as shown by agarose gel electrophoresis. 

Experiments were designed to test for B-galactoside 
production in cells microinjected with pMCllacXpa vectors with 
targeting polynucleotide-target complexes using complementary 
10 276-bp oligonucleotide targeting polynucleotide treated with 
recA. The 276-mer targeting polynucleotides in 1 X TE buffer 
were denatured by heating at 100 'C for 5 min and immediately 
quenched in an ice bath for 1 min. The DNA solution was 
collected at 4*C by centrifugation. RecA-mediated targeting 
15 polynucleotide reactions containing a final volume of 10 ^1 

were assembled using 1.0 ^1 1°^ AC buffer, 1.5 /tl 16 ATP7S, 
3.8 /tl dd HjO, 1.2 ftl i"ecA protein solution (13 iig/nl) , and 2.5 
/tl of a 30 Mg/ml stock of heat-denatured 276-bp targeting 
polynucleotide. The recA protein was allowed to coat the DNA 
20 for 10 min at 37*0. Next, 1.0 fil of lOx AC buffer, 1.0 /xl of 
0.2 M magnesium acetate, 1.3 til of pMCllacXpa (1.0 fig/itl), and 
6.7 nl of dd HjO was added to a final volume of 20 nl. Control 
reactions were performed without added recA protein. 

NIH3T3 cells were capillary needle microinjected with 
25 targeting polynucleotide-target DNA mixtures loaded in glass 
pipettes freshly pulled into microneedles using a Sutter 
instrvunents microprocessor controlled apparatus. An ECET 
Eppendorf microinjection pump and computerized micromanipulator 
were used for computer-assisted microinjection using an Olympus 
30 IMT-2 inverted microscope. Cells were carefully microinjected 
under controlled pressure and time. NIH3T3 cells injected with 
pMCllacpa showed approximately 9% of the injected cells were 
blue. None (0%) of the cells injected with pMCllacXpa DNA in 
reactions containing the 276 bp oligonucleotide but without 
35 recA protein showed a blue color. In marked contrast, 
approximately 1% of the cells microinjected with the 
recA-m diated 276-bp targeting polynucleotide targeted to the 
pMCllacXpa target hybrid were blue. Thus, these measurements 
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indicate that the mutant pMCllacXpa gene can be targeted and 
corrected by the 276-bp olig nucleotide, which has been 
targeted with recA-coated targeting polynucleotides. In 
summary, these measurements show that the 11 bp Xba I insertion 
mutation can be corrected with the recA-mediated targeted 
corrected in vivo, but not with the 276-bp oligonucleotide 
alone. Note that the in situ identification of 3T3 cells 
expressing /5-galactosidase was performed following incubation 
with X-gal (5-bromo-4-chloro-3-indolyl-/S- galactopyranoside) 
(Sigma), as described by Fischer et al. (1988) Hatuse 132: 853; 
Price et al. (1987) ?^r.n IJ^i.^ . Arf^d. SCi . (U.S.A.) 84: 156; 
Lim and Chae (1989) pi otechniques 7: 576, 

Table 1 shows the results of an experiment 
demonstrating the effect of recA coating of targeting 
polynucleotides on efficiency of gene correction. The 
combination of the 276-bp targeting polynucleotide and recA 
(Sample 5) produced a significant frequency of correction of 
the mutant /S-galactosidase gene (3.6%) as compared to samples 
without recA. 

TABLE 1 

rnPPFrTTON OF AN TNSERTT OW MDTATTON IN THE 
^-GATACTOSTni^ST: GENE OF EUKARYP TTP, TCXPRESSTON VECTOR 

pMCllacXpa WTTH RecA-gQATED NQPMAT. 276-MER DNA^ 



Sample 



Injected Plaemid, 
276-mer DNA £Uid 
RecA Protein 



PSV-p-gal-276- 
mer-RecA 



pMCllacpa-276- 
mer-RecA 



pMCllacXpa-276- 
mer-RecA 



pMCllacXpa+276- 
mer-RecA 



Number of 
Injected 
Surviving 
cells* 



168 



98 



173 



103 



Number of 

Surviving 

CellB 

Scoring 

Blue 



21 



Surviving 

CellB 

Scoring 

Blue 

(») 



12.5 



9.2 



Level of 
Signifi- 
cance 
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10 



15 



20 



25 



30 



35 




40 



• HIH 3T3 cellB were needle nicroinjected with five types of P^^^Jf ' 
«?LSdB with a wild-type p-galactosidas gene tpMCllacpa o'/SV-J-gal 

S^laJf StS=~1.«.S w!M';i.»i<.= i. .How„ for c».p«i.o„. 

» on .verage, about 50» of th. total .leroinjootod oell. .orvl«d. 
' Pive eelXa were dark blue and one vas light blue. 

' The nuntoer. of Burrirlna cells Bcorin, blue In the Butmt plasmid RecA- 
SL?^»d non-BecA-trlated ..»plea (3,4 •?«'=> SJ^f^Sf'^^A^.ted 
fourfold x' teate. The frequency of corrected cella in 

:rtJri;Tiri?i^cSS^^^^^^^ 

^on^ined and co»pared with Sample S, the frequency of corrected cells in 

?SrScA-treated sample S i« ^ f 99>X J^^ 

combined sample at the 0.1% significance level (X - 9.99>X 0.001) • 
are statistically significant data. 

Fig. 13 shows the relationship between the number of plasmids 
microinjected per cell and the correction efficiency, and shows 
the efficiency increase in gene correction which occurs with 
the RecA-coated targeting polynucleotide as compared to the 
equivalent polynucleotide which is not coated with RecA. 



EXAMPLE 3 

rn-rr-eetinq a Human rFTR Disease Allege 
Homologously targeted complementary DNA 
45 oligonucleotides were used to correct a naturally occurring 3 
bp deletion mutation in a human CFTR allele and restore 
expression of a functional CFTR protein in targeted mammalian 
cells. 

A major goal of cystic fibrosis (CF) gene th rapy is 
50 the correction of mutant portions of the CF transmembrane 

conductance regulator (CFTR) gene by replacement with wild-type 
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DNA sequences to restore the normal CFTR protein and ion 

transport function. Targeting polynucleotides that were coated 

with recA protein were introduced into transformed CF airway 

epithelial cells, homozygous for both alleles aF508 CFTR gene 

mutation, by either intranuclear microinjection, 

electr operation, or by transfection with a protein-DNA-lipid 

complex. 

Isolation and characterization of the CFTR gene 
(Rommens et al. (1989) Science lil: 1059; Riordan et al. (1989) 
Science 245 ; 1066, incorporated herein by reference) has been 
crucial for understanding the biochemical mechanism (s) 
underlying CF pathology. The most common mutation associated 
with CF, a 3-base-pair, in- frame deletion eliminating a 
phenylalanine at amino acid position 508 (AF508) of CFTR, has 
been found in about 70% of all CF chromosomes (Kerem et al. 
(1989) Science 245 : 1073; Kerem et al. (1990) Proc. Natl. Acad. 
sel. ru.s.A.V £7: 8447) . Correction of aF508 and other CFTR 
DNA mutations lies at the basis of DNA gene therapy for CF 
disease. Elimination of the cAMP-dependent CI ion transport 
defect associated with CFTR gene mutations has been 
accomplished through the introduction of the transcribed 
portion of wild-type CFTR cDNA into CF epithelial cells (Rich 
et al. (1990) Nature 347 ; 358; Drumm et al. (1990) Cell 62: 
1227) . 

An immortalized CF tracheobronchial epithelial human 
cell line, SCFTE290-, is homozygous for the aF508 mutation 
(Kunzelmann et al. (1992) Am. J. Respir. Cell. Mol. Biol., in 
press) . These cells are useful as targets for homologous 
recombination analysis, because they contain the same 3 
basepair deletion in CFTR allele on all copies of chromosome 7. 
Replacement of the aF508 allele with wild-type CFTR DNA is 
indicated only when homologous recombination has occurred. The 
491 bp region of the CFTR gene spanning exon 11 and containing 
3' and 5« flanking intron sequences was selected from sequence 
data published previously (Zielenski et al. (1991) Genoi;tj.cs 10: 
214, incorporated herein by reference) and used as a targeting 
polynucl otide. The DNA fragment was PGR amplified in 
preparative quantities and then denatured for introduction into 
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cells as recA-coated complementary ssDNA (or dsDNA) . 
Exponentially growing cells were transf ected by intranuclear 
microinjection and were propagated on the same petri dishes in 
which they were microinjected. Cells outside the microinjected 
5 area were removed by scraping with a rubber policeman. 

Exponentially growing cells were typsinized and washed before 
electr operation. Cells transf ected with protein-DNA-lipid 
complexes were grown to approximately 70-80% confluence before 

transf ection . 

The 491 bp fragment was generated by PCR 
amplification from the T6/20 plasmid (Rommens et al. (1989) 
on.eit. . incorporated herein by reference) and verified by 
restriction enzyme mapping and propagated as described 
previously. After digestion with EcoRI and Hindlll, a 860 bp 
15 insert was isolated following electrophoresis in 0.8% SeaPlaque 
agarose gel. The 860 bp fragment contained CFTR exon 10, as 
well as 5' and 3« intron sequences, as defined by the 
restriction enzyme cleavage sites (Zielenski et al. (1991) 
ep.cit. ) . A 50 ng aliquot of the fragment was amplified by PCR 
20 using primers CFl and CF5 (Table 2) to generate a 491 bp 

fragment. The conditions for amplification were denaturation, 
94« C for 1 min; annealing, 53» C for 30 sec; extension, 72« C 
for 30 sec with a 4 sec/cycle increase in the extension time 
for 40 cycles. The fragment size was confirmed by 
25 electrophoresis on a 1% agarose gel, then amplified in bulk in 
20 separate PCR amplifications, each containing 50 ng of target 
DNA. The 491 bp PCR products were extracted with 
phenol:chloroform:isoamyl alcohol (25:24:1) extraction and 
precipitated with ethanol. DNA precipitates were collected by 
30 centrifugation in an Eppendorf microcentrifuge and resuspended 
at a final concentration of 1 mg/ml. The 491 bp fragment 
contained exon 10 (193 bp) , as well as 5- (163 bp) and 3' (135 
bp) flanking intron sequences, as defined by primers CFl and 
CF5. 

35 The 491 nucleotide fragments were coated with recA 

protein using the reaction buffer of Cheng (Cheng, et al. 
(1988) -T. Biol.. Chem. 263: 15110, incorporated herein by 
reference) . Typically, the 491 bp DNA fragment (5/ig) was 
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denattired at 95^ C f r lO nin, then added to a 63 /il of coating 
buffer containing 200 /xg of recA protein, 4.8 nM ATP7S, and 1.7 
/xl reaction buffer (100 mM Tris-Ac, pH 7.5 at 37® C; 10 mM 
dithiothreit 1; 500 mM NaAc, 20 mM MgAc, 50 percent glycerol) 
5 and incubated for 10 min at 37® C. Next, the HgAc 

concentration was increased to a final concentration of about 
22 mM by addition of 7 m1 of 200 mM MgAc. Under these 
conditions, the 491 nucleotide fragment was coated with recA 
protein at a molar ratio of 3 bases per 1 recA molecule. After 

10 coating the fragments were immediately placed on ice at 4® C 
until transfection (10 min to 1 hr) . 

Microinjection, when used, was performed with an 
Eppendorf 5242 microinjection pximp fitted to an Eppendorf 5170 
micromanipulator using borosilicate pipettes (Brunswick, 1.2 

15 OD X 1.9ID) fabricated into a microneedle with a Sutter 

Instruments (P-87) micropipette puller. The micropipettes were 
filled by capillary force from the opposite side of the needle. 
Approximately 100 pipettes were used for injecting of 4000 
cells. Cells were injected with approximately 1,000-10,000 

20 fragments per cell by intranuclear injection with 120 hPa for 
0.1-0.3 s at a volume of 1-10 fl/nucleus. Microinjected cells 
were viewed with an Olympus IMT-2 inverted microscope during 
the injection. The area of the petri dish containing injected 
cells was marked with 2 to 5 mm diameter rings. Needle 

25 microinjection was performed in cells grown on 10 separate 60 
mm petri dishes. Cells were injected at room temperatxire in 
culture medium after two washes in phosphate buffered saline 
(PBS) . After microinjection, noninjected cells in the culture 
were removed by scraping. Injected cells were grown at 37** C 

30 in a humidified incubator at 7 days and then harvested for DNA 
and RNA. 

Electroporation experiments were performed using 
recA-coated 491-mer ssDNA as described above. Approximately 1 
X 10^ exponentially growing cells were suspended in 400/xl of 
35 coating buffer with 5 ^g of recA coated-DNA. The cell 
suspehsi n was pre- incubated on ice f r 10 min and 
electroporated at r om temperature with 400 V and 400 /iF in a 
BTX 300 electroporator (BTX Corporation, San Diego, CA) . After 
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electrop ration, cells wer incubated on ice for an additional 
10 min, diluted in Eagle's minimal essential medium (MEM) 
supplemented with 10% fetal bovine serum (FBS) and 100 /ig/ml 
streptomycin, 100 U/ml penicillin (Cozens et al. (1992) £roc^ 
5 ^atl. Acad- sci. fU.S.A.) 89: 5171 ; Gruenert et al. (1988) 
pr-ne. Natl. Anad. Sci, fU.S.A.) 85: 5951; Kunzelmann, (1992) 
op.cit. ). and then seeded in T75 flasks. Under these 
conditions of elecroporation, approximately 30-50% of the cells 
sTirvive. Cells were cultured for 507 days at 37» C and then 
10 harvested for DNA and RNA. 

Protein DNA-lipid complexes (liposomes) were 
prepared. Briefly, dioleoylphosphatidylethanolamine (PtdEtn, 
DOPE) was used for preparing liposomes by drying 4 /xM solutions 
of the lipid under nitrogen at room temperature. The lipid 
15 film was rehydrated with 4 ml of 30 mM Tris-HCl buffer (pH 9), 
then sonicated for 15 minutes under an atmosphere or argon. 
The protein-DNA complex was prepared in polystyrene tubes by 
diluting 20 ng of recA-coated 491-base DNA in 30 mM Tris-HCl, 
(pH 9) buffer. Protein (gramicidin S peptide, GmS: Legendre et 
20 al. (1993) PT-ne. Natl. r^n^A. fie^ . (JJ.S.A,) SO: 893) was also 
diluted with 30 mM Tris HCl (pH 9) to a final concentration of 
2 mg/ml from a 20 mg/ml stock solution prepared in dimethyl 
sulfoxide. The protein, GmS, (40 ng) was added to the DNA and 
rapidly mixed. Next, 175 nl of the liposome solution (175 
25 nmoles of lipid) were added to the peptide DNA mixture, forming 
a gramicidin S peptide DNA lipofection complex. 

Genomic DNA was isolated and pxirif ied from cells as 
described in Maniatis op.cit. to test for homologous DNA 
recombination. Cellular DNA was first PCR-amplif ied with 
30 primers CFl and CF6 (Table 2). CFl is within the region of 

homology defined at the 5' end of the 491 bp CFTR fragment CF6 
is outside the region of homology at the 3' end of this 
fragment. 

The conditions for the PGR amplification were as 
35 follows: CF1/CF6; 684/687 bp fragment; primers, 0.5 tM-, DNA, 
1-2 ngj denaturation; 94'C for 1 min; annealing; 53»C for 45 s; 
extension; 72»C for 90 s with a 4-s/cycle increase in extension 
time for 40 cycles; Mg+2 1.5 mM. DNA fragments were separated 
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by agarose electr phoresis and visualized by staining with 
ethidium bromide, then transferred to Gene Screen Plus filters 
(DuPont) . The DNA was then hybridized with the allele-specif ic 
normal CFTR 32p_end-labeled DNA probe defined by oligo N as 

5 described by Cozens et al. (1992) op.cit..; Kunzelmann (1992) 
op.cit. . incorporated herein by reference. The presence of 
wild-type (WT) sequences was determined autoradiographically by 
hybridization with the radiolabeled DNA probe: 

Homologous recombination was verified in a second 

0 round of PCR DNA amplification using the 687/684 bp fragment as 
a DNA template for amplification. The primers used in this 
allele-specif ic reaction were CFl and the oligo N or oligo aF. 
The size of the DNA fragments was 300 bp (oligo N) or 299 bp 
(oligo aF) . 

.5 The conditions for the reaction were as follows: 

CFl/oligo N/aF; 300/299 bp ftagment; primers, 0.5 iMj DNA, 1-2 
figj denatviration, 95«»C for 45s; annealing, 51'C for 30s; 
extension, 72 'C for 30 s with a 3-s/cycle increase in extension 
time for 40 cycles; Mg*^, 1.5 mM. In DNA from transfected 
0 2CFTE290- cells, amplified with the CFl/oligo N primers, a PCR 
product was detected only if the wild-type CFTR sequences were 
present. Amplification with the CFl/oligo aF gives a PCR DNA 
product of DNA targets purified from transfected and 
nontransfected LCFTE290- cells but not for DNA targets isolated 
25 from control normal cells (16HBE140-) . The presence of wild- 
type CFTR sequences in the amplified DNA fragments was also 
determined autoradiographically after hybridization with ^Zp- 
5 • -endlabeled oligo N as probe. 

Cytoplasmic RNA was isolated and denatured at 95 "C 
30 for 2 min, then reverse-transcribed using the DNA polymerase 
provided in a PCR RNA Gene Amp kit according to manufacturer's 
instructions (Perkin-Elmer/Cetus) . First strand cDNA was 
amplified by using primer CF17 at the 5' end of exon 9 and the 
allele-specif ic oligo N or oligo aF primers. The length of the 
35 PCR fragments is 322 bp (CF17/oligo N) and 321 bp (CF17/oligo 
aF) . 

The conditions for PCR amplification are CF17 /oligo 
N/aF, 322/321 bp fragment; primers, 1 tM; denaturation, 94 "C 
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for 1 min; annealing, 51- C for 30s; extension, 72»C for 20s 
with a 4-s/cycle increase in extension time for 40 cycles; 
Mg*^, 0.8 nM. DNA fragments were visualized after 
electr phoresis on ethidivim bromide-stained 1% agarose gels. 
5 In addition to the allele-specific PGR amplification of first- 
strand cDNA, Southern hybridization was performed as described 
above. Fragments were transferred to Gene Screen Plus filters 
then hybridized with allele-specific oligo N probe under the 
same conditions used for the Southern analysis of the genomic 
10 DNA (Kunzelmann et al. (1992) op.cj.t. ; Cozens et al. (1992) 

op.cit. ^ . The presence of wild-type CFTR RNA was confirmed by 
hybridization and autoradiography of RNA extracted from normal 
(16HBE140-) control DNA and in DNA of transfected ECFTE290- 
cells. 

15 Hybridization was performed as described previously 

(Cozens et al. (1992) op.eit. ^ . DNA fragments were separated 
by agarose gel electrophoresis. DNA was denatured with 0.4 N 
NaOH and 0.6 M NaCl for 30 min, then washed once with 1.5 M • 
NaCl and 0.5 M Tris-HCl for 30 min. DNA was transferred to 

20 Gene Screen Plus membrane (NEN-DuPont) by capillary blot, again 
denatured with 0.4 N NaOH for 1 min, and then neutralized with 
0.2 M Tris-HCl (pH 7.0). DNA on membranes was prehybridized 
for 1 h at 37 'C in 6 X SSC, 5 X Denhardt's solution, 1% SDS, 
containing 100 ng/ml of denatured salmon sperm DNA (Sigma) . 

25 Oligonucleotide probes (oligo N or oligo aF; 10 ng) were ^^P- 
5 • -endlabeled with 20 units of T4 kinase and 40 fiCi 32p_^-atp 
for 30 min at 37 •C. Unincorporated nucleotides were removed by 
centrifugation of the reaction mix through a minispin column 
(Worthington Biochemical Corp. , Freehold, NJ) . Hybridization 

30 was performed overnight at 37 "C. Membranes were washed twice 
for 5 min each time in 2 x SSC at room temperature, twice for 
30 min in 2 x SSC, 0.1% SDS at 45''C, and once in 0.1 x SSC for 
30 min at room temperature. , After washing, hybrids on 
membranes were analyzed autoradiographically by exposure to 

35 x-ray film. 

Analysis of ZCFTE29o- DNA sh ws replacement of the 
endogenous mutant (aF508) sequences with the exogenous normal 
fragment as evidenced by PCR amplification of genomic DNA and 
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on 



allele-specific Southern blot hybridization. PCR primers < 
inside (CFl) , and one outsid (CF6) the region of homology (491 
bp) , wer used to test whether the amplified DNA band was 
possibly due to amplification of any residual DNA fragment 
5 remaining in the cell after the transfection or by possible 

random DNA integration. A 687 bp fragment contains normal CFTR 
* sequences while the 684 bp fragment is generated from aF508 
CFTR DNA. To determine whether endogenous aF508 sequences were 
replaced with exogenous normal CFTR sequences, we analyzed 
10 aliquots of the 687 or 684 bp amplification fragments by 
southern hybridization using 32p_end-labeled DNA probes 
specific for the aF508 or wild-type sequences (Table 2) . In 
addition, the 687 bp fragment was PCR amplified by using the 
CF6 primer and a primer specific for either aF508 (oligo aF) or 
15 normal sequences (oligo N) . The second round of DNA 
amplification with the CFl/oligo N or aF primer pair 
combination yields 300/299 bp fragments, respectively. With 
the CFl/oligo N primer pair combination, a fragment will be • 
detected only if the mutant DNA has been replaced by normal 
20 sequences. Further confirmation of homologous DNA 

recombination was tested by allele-specific Southern blot 
hybridization of the 300/299 bp fragments 

Analysis of cytoplasmic RNA to detect normal exon 10 
sequences in CFTR mRNA, verify that the homologous DNA 
25 recombination was legitimate and that normal CFTR mRNA is 

expressed in the cytoplasm. To test whether the PCR generated 
DNA fragments were exclusively CFTR mRNA-derived, primers in 
exon 9 (CF17) and allele-specific (normal, oligo N or aF508, 
oligo AF) primers in exon 10. This amplification with primers 
30 CF17/N yields a 322 bp normal fragment only if transcription of 
homologously recombined DNA has occurred. A 321 bp DNA 
fragment would be generated if the aF508 mutation were present. 
Furthermore, Southern hybridization analysis with allele- 
specific 32p_end-labeled probes differentiated between normal 
35 and AF508 mutant sequences and were also used to confirm 
expression of wild-type CFTR mRNA in the cytoplasm. 

Homologous recombination between the targeting 
polynucleotide comprising WT CFTR DNA and aF508 mutant cellular 
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DNA allelic targets was evaluated by analysis of cellular DNA 
and RNA isolated from transfected and nontransf ected rCFTE29o- 
cell cultures. Nuclear genomic DNA and cytoplasmic RNA were 
isolated 6 days after transf action, CFTR exon 10 sequ nces were 
5 amplified by PGR. Oligonucleotide primers (Table 2) were used 
to amplify the region of CFTR DNA spanning exon 10. One PCR 
primer (CF 1) was within the region of homology defined by the 
491 bp DNA fragment (sense primer) , and the other (CF 6) was 
outside the homologous region in the 3* intron (antisense 
10 primer) . This DNA amplification reaction produces a 687 bp 

fragment with normal human CFTR DNA or a 684 bp fragment if the 
DNA contains the aF508 mutation, as shown in Fig. lOA. 
Southern hybridization was carried out on the 687/684 bp DNA 
fragments generated from amplification of genomic DNA from cell 
15 cultures by microinjection or by transf action with the protein- 
DNA-lipid complex, shown in Fig. lOB. A probe consisting of 
32p_end-labeled oligonucleotide DNA that hybridized only to DNA 
sequences generated from a normal exon 10 was used. DNA from 
all microinj ected and transfected cells produced specific 
20 hybrids as evidenced by autoradiographic hybridization. For 

cells microinjected with the 491 nucleotide fragment (Fig. lOB, 
lane 2) , the present of normal exon 10 sequences indicated 
homologous replacement at at least a frequency of ^ 2.5 x lO"*. 
This result indicates at least one correctly targeted 
25 homologous DNA replacement in about 4000 microinjected nuclei. 
Other similar experiments using either electroporation or 
protein-DNA-lipid transfection to transfer the recA-coated 491 
nucleotide CFTR DNA fragments also showed homologous 
recombination with the normal CFTR sequence in transfected CF 
30 cells. No hybridization was observed in control nontransf ected 
(or mock- injected 2CFTE290- cells). In each cell transfected 
with normal CFTR DNA, analysis of the genomic DNA in a second 
round of allele-specif ic amplification of the 687/684 bp 
fragments with primers CFl/oligo N (Table 2) clearly showed the 
35 300 bp fragment expected when wild-type CFTR sequences are 
present, as shown in Fig. IIA. Fragments were detected for 
control 16HBE140- cells (Fig. IIA, lane 2) and cells 
transfected with recA-coated DNA (Fig. IIA, lanes 5 and 6) . A 
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299 bp fragm nt (AF508-sp cific primer ends one base closer to 
the CFl than the oligo N) was det cted in DNA from 
nontransfected 2CFTE29o- cells amplified with CFl/oligo aF 
primers (Fig. IIA, lane 4). No fragment was detected in DNA 
5 from nontransfected LCFTE29o- cells reamplified with the 
CFl/oligo N primers (Fig. IIA, lane 3). Allele-specif ic 
Southern blot hybridization of these fragments with the 
•^^P-endlabeled oligo N probe resulted in autoradiographic 
hybridization signals from control normal and transfected CF 

10 cells (Fig. IIB, lanes 1, 4, and 5) but not from DNA of 

nontransfected CF cells amplified with CFl and oligo-N or -aF 
(Fig. IIB lanes 2 and 3). We tested whether any residual 491 
nucleotide DNA fragments which might remain in the cell after, 6 
days could act as a primer for the PCR reaction^ genomic 

15 " ZCFi?E29o- DNA was incubated with an equivalent nvunber of recA- 
coated DNA fragments (10^-10^) introduced by microinjection 
(Fig. 12). One antisense primer contains the wild-type normal 
(N) sequence while the other contains the AF508 (aF) mutation. 
Amplification with the CF1/aF primer combination gives a 299 bp 

20 fragments when the aF508 mutation is present. No DNA fragment 
product was detected when the CFl/N primer combination was used 
with control nontransfected ZCFTE290- DNA (Fig. 12^ lane 2). 
However, when the CF1/aF primer combination was used for DNA 
amplification in nontransfected ECFTE29o- cells, a DNA product 

25 of the expected size (299 bp) was produced (Fig. 12, lane 1). 
These results indicate that all residual 491 nucleotide DNA 
fragments which might remain in the cells after 6 days of 
culture were incapable of competing with the CFl PCR primers in 
the PCR amplification of the 687/684 bp fragments. 
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Table 2 

PCR Primers and oligonuc leotides 

DNA Sequence 

5 • -GCAGAGTACCTGAAACAGGA 
5 • -CATTCACAGTAGCTTACCCA 
5'- 

5'- 

5 • -CACCAAAGATGATATTTTC 
5 » -AACACCAATGATATTTTCTT 

15 Notes: 

(1) CFl and CF5 PCR primers were used to synthesiz 
the 491 bp fragment used for the targeting polynucleotide. 

(2) CFl and CF6 PCR primers were used to amplify the 
687/684 bp CFTR fragment. 

20 (3) The CF17 primer is located at the 5' end of exon 

9 and was used for amplification of first strand cDNA derived 

from CFTR mRNA. 

(4) Oligo N and Oligo AF are allele-specif ic probes 
and can also be used as allele-specif ic PCR primers for 

25 amplifying the 300/299 bp fragments (DNA analysis) and the 
322/321 bp fragments (RNA analysis). 

(5) Sense (S) and antisense (A) primers are 
designated under DNA Strand and indicate the sense of the 
strand relative to the transcribed direction (i.e., the CFTR 

30 iQRNA) . 



5 m iaonucleotide DNA Strand 

CFl S 

CF5 A 

CF6 A 
CCACATATCACTATATGCATGC 

10 CF17 S 
GAGGGATTTGGGGAATTATTTG 

OLIGO N A 

OLIGO AF A 
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The corrected CFTR DNA must also be expressed at the 
inENA level for normal function to be restor d. Therefore 
cytoplasmic CFTR mRNA was analyzed for th presence of a normal 
CFTR RNA sequence in the aF508 regi n of exon 10. Cytoplasmic 
5 RNA was isolated from the cells, reverse-transcribed with DNA 
polymerase and PCR-amplif ied as first-strand cDNA. This 
amplification was performed with a PGR primer located in exon 9 
(CF17, sense) and CFTR allele-specif ic PCR primer in exon 10 
(oligo N or aF, antisense) . The exon 10 primer contains the CF 

10 mutation site, and the resulting fragment is 322 bp in normal 
DNA or 321 bp in DNA containing the aF508 mutation. 
Amplification of genomic DNA is eliminated by using primers 
that require amplification across intron/exon boundaries. 
Amplified cDNA generated from normal control 16HBE140- cells 

15 * and experimentally transf ected cells yielded DNA product 
fragments with the CF17/ oligo N, whereas nontransf ected 
LCFTE290- cells only showed a DNA fragment after amplification 
with the CF17/oligo aF primers but not with the CF17/oligo N • 
primers. Cells electr operated with wild-type 491-mer CFTR DNA 

20 showed the presence of wild-type CFTR mRNA. In addition, 
protein-DNA-lipid-transf ected SCFTE290- cell cultures also 
showed the presence of wild-type CFTR mRNA in cells transf ected 
with the recA-coated 491 nucleotide fragment. Southern 
hybridization of the 322/321 bp cDNA fragments with the ^^p- 

25 end-labeled N oligonucleotide DNA probe showed the specificity 
of the PCR amplification and produced specific autoradiographic 
hybridization signals from all cell cultures transfected with 
recA-coated 491 nucleotide targeting polynucleotide. No 
autoradiographic hybridization signals were detected in 

30 nontransfected SCFTE290- cells amplified with the CF17/oligo N 
or oligo aF primers. These analyses verify that the genomic 
DNA homologously recombined with the WT 491-mer DNA at the 
AF508 CFTR DNA locus resulting in RNA expressed and transported 
to the cytoplasm as wild-type CFTR mRNA. 

35 This evidence demonstrates that human CFaF508 

epithelial cells CFTR DNA can homologously recombine with 
targeting polynucleotides comprising small fragments f WT CFTR 
DNA resulting in a corrected genomic CFTR allele, and that a 
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' recA-c ated targ ting polynucleotide can be used in 

Lansfection r actions in cultured hu^an cells, and that cystxc 
fibrosis AF508 mutations can be corrected in genome DNA 
resulting in the production of normal CFTR cytoplasmic mRNA. 
5 Functional analysis of CFTR CI" transport xn mock- 

transfected and transfected cells was performed by measurement 
of 36ci efflux. Following exposure to the cAMP agonist, 
forskolin, a slight increase in the rate of Cl" 
detected in the population of transfected cells. A selection 
10 protocol was developed to select for small subpopulatxons of 
phenotypically normal cells (i.e. , transfected cells having 
functionally normal CI" ion transport) by depleting 
intracellular Ca^^ with a Ca*^ ionophore in a Ca^^-free buffer 
and exposing the cells to hypoosmqlar and then hyperosmolar 
15 buffers. Defective CF cells that had not undergone functional 
correction by homologous recombination with a targeting 
polynucleotide were thus selected against. Measurement of 
36ci- efflux in cells enriched by this selection protocol ■ 

. . ^ . 36pT- following stimulation 

indicated a significant increase m CI following 

20 of cAMP-dependent Cl~ transport. 

Taken together, the data provided indicates that 
small (e.g., 491-mer) ssDNA fragments can find their genomic 
homologues when coated with recA protein and efficiently 
produce homologously targeted intact mammalian cells having a 
25 corrected gene sequence. Analysis of CFTR in cytoplasmic RNA 
and genomic DNA by allele-specif ic polymerase chain reaction 
(PGR) amplification and Southern hybridization indicated wild- 
type CFTR DNA sequences were introduced at the appropriate 
nuclear genomic DNA locus and was expressed as CFTR i« 
30 transfected cell cultures. Thus, in human CF airway epithelial 
cells, 491 nucleotide cytoplasmic DNA fragments can target and 
replace the homologous region of CFTR DNA containing a 3 bp 

AF508 deletion. 

correctly targeted homologous recombination was 
35 detected in on out of one microinjection experim nt with recA- 
coated targeting polynucleotide, two of two different 
electroporation experiments with recA-coated targeting 
polynucleotide, and one of one lipid-DNA-protein complex 
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transfection experiment with recA-coated targeting 
polynucleotide. Taken together, these 4 separate experiments 
strongly indicate that homologous recombination with relatively 
small reoA-coated targeting polynucleotides (491-mer CFTR DNA) 
is feasible for treatment of human genetic diseases, and can be 
performed successfully by using various methods for delivering 
the targeting polynucleotide-recombinase complex. 

Although the present invention has been described in 
some detail by way of illustration for purposes of clarity of 
understanding, it will be apparent that certain changes and 
modifications may be practiced within the scope of the claims. 
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(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 162 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 
ATGATTACGG ATATCGAATT AATACGACTC ACTATAGGGA GATCGAATTC GAGCTCGGTA 
- CCCGGGGATC CTCTAGAGTC GACCTGCACC TGCAGGGGCC CTCGAGACGC GTGGCATGCA 
AGCTTTCTCC CTTTAGTGAG GGTTAATTAT AGGCCTAGCT TG 
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(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (primer) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2! 
CGTAAGTGAA GCGACCCGCA 
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(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (primer) 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:3:* 
AGGCCAATCC GCGCCGGATG C 
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(2) INFORMATION FOR SEQ ID NO: 4: 

fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 270 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 
CGTAAGTGAA GCGACCCGCA TTGACCCTAA CGCCTGGGTC GAACGCTGGA AGGCGGCGGG 
CCATTACCAG GCCGAAGCAC GGTTGTTGCA GTGCACGGCA GATACACTTG CTGATGCGGT 
GCTGATTACG ACCGCTCACG CGTGGCAGCA TCAGGGGAAA ACCTTATTTA TCAGCCGGAA 
AACCTACCGG ATTGATGGTA GTGGTCAAAT GGCGATTACC GTTGATGTTG AAGTGGCGAG 
CGATACACCG CATCCGGCGC GGATTGGCCT 



SUBSTITUTE SHEET 



wo 93/22443 



PCi>«j!syj>iiVJAt)6 



70 



(2) INFORMATION FOR SEQ ID NO: 5: 

(il SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 281 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: • 
CGTAAGTGAA GCGACCCGCA TTGACCCTAA CGCCTGGGTC GAACGCTGGA AGGCGGCGGG 
CCATTACCAG GCCGAAGCAC GGTTGTTGCA GTGCACGGCA GATACACTTG CTGATGCGGT 
GCTGATTACG ACCGCTCACG CGCTCTAGAC GCGTGGCAGC ATCAGGGGAA AACCTTATTT 
ATCAGCCGGA AAACCTACCG GATTGATGGT AGTGGTCAAA TGGCGATTAC CGTTGATGTT 
GAAGTGGCGA GCGATACACC GCATCCGGCG CGGATTGGCC T 



60 
120 
IBO 
240 
281 
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(2) INFOBMATION FOR SEQ ID NO: 6: 

SEQUENCE characteristics: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (primer) 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6! 
CTCTAGACGC G 
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(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (primer) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 
GCAGAGTACC TGAAACAGGA 
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(2) INFORMATION FOR SEQ ID N0:8: 

fi^ SEQUENCE characteristics: 
^ (A) length: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (primer) 
(iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID 
CATTCACAGT AGCTTACCCA 
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(2) IKFORMMION FOR SEQ ID NO: 9: 

(L) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (primer) 
(iv) ANTI-SENSE: YES 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
CCACATATCA CTATATGCAT GC 
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(2) INFORMATION FOR SEQ ID NO: 10: 

fi^ SEQUENCE characteristics: 

(A) length: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (primer) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:: 
GAGGGATTTGGGGAATTATT TG 
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(2) INFORMATION FOR SEQ ID NO: 11: 

fi) SEQUENCE characteristics: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (primer) 
(iv) ANTI-SENSE: YES 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 
CACCAAAGAT GATATTTTC 
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(2) INFORMATION FOR SEQ ID NO: 12: 

{L\ SEQOEHCE CHARACTERISTICS: 

(A) LENGTH: 20 base paxrs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: Single 

(D) TOPOLOGV: linear 

(ii) MOLECULE TYPE: DNA (primer) 
(iv) ANTI-SENSE: YES 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12! 
AACACCAATG ATATTTTCTT 
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CLAIMS 



1 A method for targeting and altering, by homologous 

recombination, a pre-select d target DNA sequence in a 
eukaryotic cell to make a targeted sequence modificati n, saxd 
method comprising the steps of: 

introducing into at least one eukaryotic cell at 
least one recombinase and at least one targeting polynucleotide 
having a homology clamp that substantially corresponds to or xs 
substantially complementary to a preselected target DNA 

sequence; and 

identifying a eukaryotic cell having a targeted DNA 

sequence modification at a preselected target DNA sequence. 

"2 A method according to Claim 1, wherein at least two 

targeting polynucleotides which are substantially complementary 
to each other are used. 

3. A method according to Claim 1, wherein said 
recombinase is a specie? of prokaryotic recombinase. 

4. A method according to Claim 3, wherein said 
prokaryotic recombinase is a species of prokaryotic recA 
protein. 

5. A method according to Claim A, wherein said recA 
protein species is E. coli recA. 

6 A method according to Claim 1, wherein said targeting 
polynucleotide is conjugated to a cell-uptake component. 

7 A method according to Claim 6, wherein said cell- 
uptake component is conjugated to said targeting polynucleotide 
by noncovalent binding. 

8. A method according to Claim 6, wherein th cell- 

uptake component comprises an asialoglycoprotein. 



SUBSTITUTE SHEET 



PCr/US93/03868 

WO 93/22443 



79 

9. 



9 A method according to Claim 6, wherein the cell- 
uptake component comprises a pr t in-lipid complex. 

10 A method according to Claim 6. wherein said targeting 
5 polynucleotide is conjugated to a cell-uptake component and to 

a recombinase, forming a cell targeting complex. 

11 A method according to Claim 1, wherein said targeting 
polynucleotide comprises a homology clamp that is complementary 

10 to said preselected target DNA sequence. 

12. A method according to Claim 11, wherein the targeting 

polynucleotide consists of a homology clamp. 

15 13 A method according to Claim 2, wherein a first said 

targeting polynucleotide comprises a homology clamp that xs 
complementary to said preselected target DNA sequence and a 
second said targeting polynucleotide comprises a homology clamp 
that corresponds to said preselected target DNA sequence. 



20 



25 



14. A method according to Claim 13, wherein said first 

targeting polynucleotide consists of a homology clamp. 

15 A method according to Claim 13, wherein the homology 
cllmp of said first targeting polynucleotide and the homology 
clamp of said second targeting polynucleotide are 
complementeiry . 

16 A method according to Claim 2, wherein a first said 
targeting polynucleotide comprises a homology clamp that is 
complementary to a preselected target DNA sequence. 

17 A method according to Claim 16, wherein a second 
targeting polynucleotide comprises a homology clamp that is 

35 complementary to a sequence of said first targeting 

polynucleotide. 

18. A method according to Claim 17, wherein said s cond 
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targeting polynucleotide consists of a sequence that is 
complementary to th complete sequence of said first 
polynucleotide . 

19. A method according to Claim 1, wherein the 
preselected target DNA sequence is unique in a haploid genome 
of said exjkaryotic cell. 

20. A method according to Claim 19, wherein the 
preselected target DNA sequence is unique in a diploid genome 
of said eiikaryotic cell. 

21. A method according to Claim 1, wherein the targeted 
se^ence modification comprises a deletion of at least one 
additional nucleotide. 

22. A method according to Claim 1, wherein the targeted 
seiience modification comprises the addition of at least one - 
additional nucleotide. 

23. A method according to Claim 22, wherein the targeted 
sequence modification corrects a human disease allele in a 
hximan cell. 

24. A method according to Claim 22, wherein the human 
disease allele is a CFTR allele associated with cystic 
fibrosis. 

25. A method according to Claim 1 or Claim 6, wherein the 
recombinase and the targeting polynucleotide are introduced 
into the eukaryotic cell simultaneously. 

26. A method according to Claim 25, wherein the 
recombinase and the targeting polynucleotide are introduced 
into the eukaryotic cell by a method selected from the group 
consisting of: microinjection, electroporation, r contacting 
of the cell with a lipid-protein-targeting polynucleotide 
complex. 
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27. A method acc rding t Claim 1, wherein th targeted 
s quence modification creates a sequence that encodes a 
polypeptide having a biol gical activity. 

28. A method according to Claim 27, wherein the 
biological activity is an enzymatic activity. 



29. A method according to Claim 27 or Claim 28, wherein 
the targeted sequence modification is in a human cell and 

10 encodes a human polypeptide. 

30. A method according to Claim 29,- wherein the targeted 
sequence modification is in a human oncogene or tumor 
suppressor gene sequence. 

15 ^ . 

31. A method according to Claim 30, wherein the targeted 

sequence modification is in a human p53 sequence. 

32. A method according to Claim 29, wherein the targeted 
20 sequence modification is in a human CFTR allele. 

33. A method according to Claim 32, wherein the targeted 
sequence modification occurs in a human cell. 

25 34 . A method according to Claim 1, wherein the targeting 

polynucleotide comprises a homology clamp that is less than 500 
nucleotides long. 

35. A method according to Claim 34, wherein the targeting 
30 polynucleotide is less than 500 nucleotides long. 

36. A composition for producing a targeted modification 
of an endogenous DNA sequence, comprising a targeting 
polynucleotide and a recombinase. 



35 



37. A compositi n according to Claim 36, wherein the 

targeting polynucleotide is noncoyalently bound to said 
recombinase • 
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38. A composition acc rding to Claim 36, fxirther 
comprising a cell-uptake component. 

39. A composition for producing a targeted sequence 
modification of a human disease allele, comprising a targeting 
polynucleotide containing a corrected sequence and a 
recombinase or an expression polynucleotide that encodes and 
expresses a recombinase. 

40. A composition according to Claim 39, further 
comprising a cell-uptake component. 

41. A composition according to claim 39 or Claim 40, 
wherein the human disease allele is a CFTR allele. 

42. A kit for therapy, monitoring, or prophylaxis of a 
genetic disease comprising a recombinase and a targeting 
polynucleotide. 

43. A kit for therapy, monitoring, or prophylaxis of a 
genetic disease according to Claim 42, further comprising a 
cell-uptake component. 

44. A method for treating a disease of an animal 
harboring a disease allele, comprising administering to the 
animal a composition consisting essentially of: 

a recombinase or an expression polynucleotide 
encoding a recombinase; ajnd 

a targeting polynucleotide which produces a sequence 
modification upon homologous recombination with the disease 
allele. 

45. A method according to Claim 44, wherein the 
composition further comprises a cell-uptake component. 

46. An animal comprising an allele that has a seqeunce 
modification according to the method of Claim 44. 
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FIG, IB, 
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FIG. I a 




FIG. ID. 
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FIG. 4. 
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FIG. 5A FIG. 5B. 




FIG lOA. FIG lOB. 
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^ EcoRV-Bd ^ 
fragment from SV40 



FIGURE 6 
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^ EcoRV-Bd ✓ 
fragment from SV40 



FIGURE 7 
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HGURE 12 



SUBSTITUTE SHEET 



wo 93/22443 



12/12 



PCT/US93/03868 




0 500 1000 

Number of Plasmids MicroIn]ected per Cell 



HGUREIS 



SUBSTITUTE SHECT 



INTERNA'nONAL SEARCH REPORT 



No 



PCT/US 93/03868 



L CLASSnCATION OF SUBJECT MATTEK (If smwil dastflctttoo tyntob a^ply, Uiktt« tU>* 



Aoeofilig to InrniTlnail PaM Ouslftafioa (IPC) or to hoib Naitoul CUntflcttloB »»i IPC 

Int.Cl. 5 C12N15/90; C12N5/10; C12Q1/68; 

A61K48/00; A01K67/027 



G01N33/68 



IL FIELDS SEAKCHO) 



Mlalasm DoanMOiioo Sovcborf' 



Clastflatloa SysUB 



CUsifiatioo SymkoU 



Int.Cl. 5 



C12N ; 
AOIK 



A61K ; 



C12Q : 



GOIN 



tOtktElCMBttfalttMh 



S«ftM 

Oocu 



I isdoM la tb« FkUs SoucM* 



m. DOCUMDOS CO^BIDatED TO BE SELCVANT* 



Citoionr* 



IUl«vuttoaalaNA.U 



NUCLEIC ACIDS RESEARCH 

vol. 17. no. 1, 11 January 1989. IRL 

PRESS, OXFORD. UK; 

pages 147 - 161 

B. SAUER AND N. HENDERSON *Cre-st1nulated 
recombination at loxp-contalning DNA 
sequences placed into the mammalian 
genome' 

see page 158. line 9 - page 160. line 25 
SCIENCE 

vol. 251. 15 March 1991. AAAS. 

VASHINGTON.DC.US; 

pages 1351 - 1355 

S. 0*GORMAN ET AL. 'Recombinase-mediated 
gene activation and site specific 
intergration in mammalian cells' 
see page 1351. left column, line 1 - 
middle column, line 8 



1.11.12, 
23.28. 
29.35. 
37.38 



1.11,12. 

23.28. 

29.35.37 



Bti:« 



•SptdMli 

oo^iflNtf to to pvttcBltr nitvuei 
ouUtftaDMttatNkUMnoriftffttoiBtmtlfiail 
flltecteto 



htm 4ocai«t pabUfboA tfttr tk« tnt gn att pftal fiUng iato 
or ■rlortty tato oot to oMfttei with tho tMttcatiM tot 
dtk to ntaitaU tbo priMtylo V thmy mateiyia^ 



Btef 



my thraw tabes OB orioritr dalB(t) or 
9 tnbUsh tb« MbUctfioB dito of OBottor 
rs9oddnueB(uirodM) 

'p* 4eeBm«tnfoRlattaaaonl4iKknB«,Bt%ci±ibMoaor 

"P* tecBMBtyobUikoripriortothol 
laMr ttoa tbo ^rioctiy toto cUlBi 



fUlBgdatobBt 



Vr iOGBIMBt OffUClODltf ntlVIBO^ ^ 

fiir*** to CDBfUffori to IbvdIvo ib Ib wtlvo flop BtoB tto 
taaoBt Is cooibtBod vltb oao or man otfaor ndb iocn- 
MBti, sach coiBblBitloB b^ obvloBS to a p«»B skUM 
iBttotrt 

'Jk* 4ocBaslMflBb«rofttonao9otMtfuUl]r 



IV. cExnncAiioN 



Osto of tbo ActBd Coa^fliOB of tho iBtoiBitioBal SouA 

09 JULY 1993 



Dau of MaUiBf of this iBtmtloBB] Sooreb Boroft 



02 -09- 1993 



IntmttioBBl SoudUag Astbortty 

CUR PEAN PATENT FFICE 



Sli^atart of Aatbofixotf Offlev 

HORNIG H. 



PCT/US 93/03868 

latVBitloaftl Applinrion No 



m. DOCUMENTS CO^I!;Il^faEn TO BE RELEVANT (CONT1NUE9 ntOM THE SECOND SIU£1) 






atHtaa af DocBMBt, wtib IWioUao, vtam iffnpriM, of Ih. mrant patafB 




X 


J. BACTERIOL. 

vol. 172, no. 2. February 1990, AM. SOC. 
MICROBIOL. .BALTIMORE, US; 
pages 610 - 618 

H. MATSUZAKI ET AL. 'Chromosome 
engeneering In Saccharomyces cerevisiae by 
using a site-specific recombination system 
of a yeast plasmid' 

see page 610, left column, line 1 - right 
column, line 18 




1,11.12, 
23,28. 
29,35, 
37,38 


Y 


J. BIOL. CHEM. 

vol. 267, no. 2, 15 January 1992, AM. 
SOC. BIOL. CHEM. INC. US; 
pages 963 - 967 

J.M. WILSON ET AL. 'Hepatocyte-directed 
gene transfer in vivo leads to transient 
improvement of hypercholesteremia in low 
density lipoprotein receptor-deficient 
rabbits' 

cited in the application 

see page 965, right column, line 1 - page 

967, right column, line 12 




6-8,10 


Y 


NATURE 

vol. 350, 21 March 1991, MACMILLAN 

JOURNALS LTD., LONDON, UK; 

pages 243 - 246 , . , 

P. HASTY ET AL. 'Introduction of a subtle 

nutation into the Hox-2.6 locus in 

embryonic stem cells' 

see page 342, right column, line 1 - page 

244, left column, line 9; figures 1-4 


• 


I, 3-5. 

II. 12, 
26-30,35 


Y 


WO, A, 9 117 267 (SRI INTERNATIONAL) 

14 November 1991 

cited in the application 

see page 4, line 5 - line 34; claims 1-18 




1.3-8. 
10-12. 
26-30,35 


P.X 
P.Y 


W0,A,9 208 791 (THE UNITED STATES OF 

AMERICA) 

29 May 1992 

see page 14, line 26 - page 24, line 12; 
claims 1-20 




37,38, 
40.43 

1.3-8, 
10-12, 
26-30,35 


P.X 
P.Y 


WO, A, 9 305 178 (DAIKIN INDUSTRIES) 

18 March 1993 

cited in the application 

see page 29, line 1 - page 34, line 17; 

claims 1-59 




37.38, 
40,43,45 

1.3-8, 
10-12, 
26-30,35 



PCT/US 93/03868 

ArflleulnNe 



1 HLDOOJ] 


MENTS CONSmnfT) TO BEIfXEVANT (OONTINIJED FSOM THE SECOND SHEET) 


aM«iut to Oaia Na 




V0,A,9 220 808 (CELL GENESYS, INC.) 
26 November 1992 

see page 4» line 32 - page 8, line 24 
see page 10, line 15 - page 19, line 3; 
claims 1^17 


I. 3-5, 

II. 12. 
26-30.35 



9mm PCT/XSA/ZIO (flttm mm}{l^uj 1«5) 



INTERNATIONAL SEARCH REPORT 



Imcmaiional applicaiton No. 

PCT/ US 93/ 03868 



Hox I OliNcrvaimns where cerium claim.s were lound unsearchable (C.'oniinuation ciritcm J or first sheet) 



Hii:. iiuciitauoiul xcaiUi icptiii has mil Ihxii c^ublishcJ in rcspcci ol ccruuii d«ums uiuJcr Aiuclv 1 7f2)(a) Tor ihu folluwing reasons: 

1, j XJ Claitn^ Nt«s,: 

' — Uxausc ihcy rclaiL* lu »;ub|cci mailer nui required lo be seaielied by ihts Auihoriiy, namely. 

Remark: Although claims 1-36 as far as they concern an "In vivo" treatment 
and 45,46 are directed to a method of treatment of the human/animal body 
the search has been carried out and based on the alleged effects of the 
compound/composi 1 1 on . 

2. j Claims Nos^ 

because ihey rclau* lu paru of ihe internaUttiiAl applitauun ihai do noi comply wiih ihc prescribed rcquiremcnu to such 
an cxieiit Uiai no meaningrul inicrnauonal search can be carried ouU specifically: 



Claims Nos» 

' heuui^e ihcy arc dcpcndeni claiiiu aiid are nul drafted in atxurdzna* with ihe second and third sentences of Rule 6.4(a). 



Box l] Obscrvaiiuns where unity of invention is lacking; (Continuation or item 2 ofdrst sheet) 



riiui lnu.iiiaiiuiial Scaiduni; AuUuiriiy Tound inulupic tnvcnuons in this inicrnaiional appticaiion. as Totlows: 



I. { j As all icquiicd addiuutial search Tees nverv timely pud by the applicanu this intt:rnauonal search report covers alt 



seal chablc claims. 



2. I j As all seal (.liable claiiiu could be scardics wiiiiout cfTurt justifying an addtitunal fee. this Authority did not invite payment 
of any addtitunal fee. 



3. As only some of iIk required additional search fees were umcly paid by the applicant, this inicrnaiional search report 
covers only those claims for which fees were paid, spcxtficalty claims Nos^ 



4. I I No requticd additional search fees were timely paid by the applicant. Consequently, this tntcrnaiionxl search report is 
resuictcd to the invention first mentioned in the claims; it is covered by claims Noc: 



Remark on l*rotcst | | The additional search fees were accompanied by the applicant's proicsL 

I I No protest accompanied the payment of additional search fees. 



l-urin PCI MSA.: 10 Ktiiiuiiuaimii of fust sheet (1)} (July 



ANNEX TO THE INTERNATIONAL SEARCH REPORT 
ON INTERNATIONAL PATENT APPLICATION NO. US 9303868 

SA 73747 



the D i r iiiwi i Ptmt OBcb ■ ■ — wy irtto hr fbmi paniBilf iiMdi f w i ilj' giwp hr the p m pt m rf inhtwMitQ. 09/07/93 











W0-A-9H7267 


14-11-91 


CA-A- 
EP-A- 
WO-A- 
JP-T- 


2056983 
0481065 
9305178 
4507198 


08-11-91 
22-04-92 
18-03-93 
17-12-92 


WO-A-9208791 


29-05-92 


AU-A- 


9111691 


11-06-92 


WO-A-9305178 


18-03-93 


CA-A- 
EP-A- 
JP-T- 
WO-A- 


2056983 
0481065 
* 4507198 
9117267 


08-11-91 
22-04-92 
17-12-92 
14-11-91 


WO-A-9220808 


26-11-92 


AU-A- 
EP-A- 


2140292 
0539573 


30-12-92 
05-05-93 



I 

i 

O 



PATENT COOPERATION TREATY 



wo 93/22443 • 
PCT/US93/03868 



From the INTERNATIONAL BUREAU 



iff 



PCT 

NOTICE INFORMING THE APPLICANT OF THE 
COMMUNICATION OF THE INTERNATIONAL 
APPLICATION TO THE DESIGNATED OFFICES 

(PCT Rule 47.1(c), first sentence) 


To: 

CLARK, Janet, Pauline 
SRI International 
OtjO navenswooa Mveiiuc 
Menio Park, CA 94025-3493 
ETATS-UNIS D'AMERIGUE 


Date of mailing: 

11 Novennberl993 (11.11.93) 


Applicant's or agent's file reference: 
P-2898 


IMPORTANT NOTICE 


International application No.: International filing date: PrioritY date: 

PCT/US93/03868 23 April 1993 (23.04.93) 24 April 1992 (24.04.92) 


Applicant: 

SRI INTERNATIONAL 



1. Notice is hereby given that the International Bureau has communicated, as provided in Article 20. the International application 
to the following designated Offices on the date indicated above as the date of mailing of this Notice: 



AU,BR,CA,EP,FI,HU,JP,KR,N0,N2 

2. In accordance with Rule 47.1(c). third sentence, each designated Office will accept the present Notice as conclusive evidence 
that the communication of the international application has duly taken place on the date of mailing indicated above and no 
copy of the international application is required to be furnished by the applicant to the designated Offices. 

3. Enclosed with this Notice is a copy of the international application as published by the International Bureau on 



1 1 November 1 993 (1 1 .1 1 .93) under No. WO 93/22443 



REMINDER REGARDING CHAPTER II (Article 31(2)(a) and Rule 54.2) 

If the applicant wishes to postpone entry into the national phase until 30 months (or later in some Offices) from the 
priority date, a demand for international preliminary examination must be filed with the competent International 
Preliminary Examining Authority before the expiration of 19 months from the priority date. 

It is the applicant's sole responsibility to monitor the 19-month time limit. 

Note that only an applicant who is a national or resident of a PCT Contracting State which is bound by Chapter II has 
the right to file a demand for internationi preliminary examination. 

REMINDER REGARDING ENTRY INTO THE NATIONAL PHASE (Article 22 or 39(1)) 

If the applicant wishes to proceed with the international application in the national phase, he must, within 20 months 
or 30 months, or later in some Offices, perform the acts referred to therein before each designated or elected Office. 

For further important information on the time limits and acts to be performed for entering the national phase, see the 
Annex to Form PCT/IB/301 (Notification of Receipt of Record Copy) and Volume II of the PCT Applicant's Guide. 



The International Bureau of WlPO 
34« chemin des Colombettes 
1211 Geneva 20, Switzerland 



Facsimile No.: (41.22)740.14.35 



Authorised officer: 



5[;i im::ii;AiiONAL 
NOV 2 9^ 1393 



J. Zahra 

Telephone No.: (41.22)730.91.11 INTrU.Er.Tl!fl' PonorpTY COUNSEL 



Form PCT/IBy308 (July 1992) 



306162 



